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The present invention 
is a substantially purified 
sortase-transamidase enzyme from 
Gram-positive bacteria, such as 
Staphylococcus aureus. The enzyme 
having a molecular weight of about 
41,000 daltons and catalyzing a 
reaction that covalently cross-links 
the carboxyl terminus of a protein 
having a sorting signal to the 
peptidoglycan of a Gram-positive 
bacterium, the sorting signal having: 
(1) a motif of LPX3X4G therein; (2) 
a substantially hydrophobic domain 
of at least 31 amino acids carboxyl 
to the motif; and (3) a charged tail 
region with at least two positively 
charged residues carboxyl to the 
substantially hydrophobic domain, 
at least one of the two positively 
charged residues being arginine, 
the two positively charged residues 
being located at residues 31-33 
from the motif, wherein X3 is any 
of the twenty naturally-occurring 
I^amino acids and X4 is selected 
from the group consisting of alanine, 
serine, and threonine, and wherein 
sorting occurs by cleavage between 
the fourth and fifth residues of the 

LPX3X4G motif. Variants of the enzyme, methods for cloning 
of use of the enzyme, including for screening for antibiotics 
bacteria, are also disclosed. 



Matching Percentage (Total Window: 21%, Alignment Window: 22%) 

S.aureus 



yq — 43 



DPK L K — EI 

II I II I 

S.pyogenes 1 mi^ysqklpkepamsgfleqiuxshclrqmaekgleallvthltnsyyltg 



50 



44 



51 
94 
101 
144 



151 



194 



201 



244 



251 
294 



1 VLE — SQM KAINE 1 R-P 93 

I Ml til 1 II 

FSGTAATVLITAKRRVLITDSRYTLIAKAS VEGFDI IESRTPLKWAELL 100 

G MT G A E A 143 

I I I I I I 

EADQI DC LG FEDQVS FS FYQAMQAELSGI TLLAQSGFVEHLRL I KDASEI 150 

D — A IS R NYLE SK 193 

I I II III I 

DTIAKACSISDKAFEDALDFIKPGTTTERDLANFLDFRMRQYGASGTSFD 200 

GY G K E FG H — SLG H-G- 243 

II II I II I I II 

IIVASGYLSAMPHGRASDKVIQNKESLTMDFGCYYNHYVSDMTRTIHIGQ 250 

IG-LE IHE G PM-L A R-T 293 

II I I III II 

VTDEEREIYALVXAANKALIAKASAGMTYSDFDGIPRQLITEAGYGSRFT 300 



— IQ d K L-Q VNNCVTVBPGVYIEGLG GI 

I I I III II III I I I 



34 3 



301 HGIGHGIGLDIHENPFFGKSEQLLQAGMV VTDEPGIY LDNKYGV 350 



344 



351 



RIEDDIL-I TENGCQVFTKCTKDLI VLT * 393 

Mill I M (111 I I MM I 

RIEDD-LVITKTGCQVLTIAPKELI VL- * 400 

(SEQ ID No: 2 & 34) 

the gene encoding the enzyme and expressing the cloned gene, and methods 
and for display of proteins or peptides on the surfaces of Gram-positive 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/09145 



PCT/US98/16229 



-1- 

IDENTIFICATION OF SORTASE GENE 

GOVERNMENT RIGHTS 

This invention was supported by grants from the United States 
government, namely grants from the National Institutes of Health, NIH-NI AID Grant 
Nos. AI 33985 and 38897. Accordingly, the government may have certain rights in 
this invention. 

5 

BACKGROUND OF THE INVENTION 
This invention is directed to an enzyme from Gram-positive bacteria, 
designated sortase-transamidase, nucleic acid segments encoding the enzyme, and 
methods of use of the enzyme. 

10 Human infections caused by Gram-positive bacteria present a medical 

challenge due to the dramatic increase in multiple antibiotic resistance strains in 
recent years. Gram-positive bacteria that can cause serious or fatal infections in 
humans include Staphylococcus , Streptococcus, Enter ococcus, Pneumococcus, 
Bacillus, Actinomyces, Mycobacterium, and Listeria, as well as others. Infections 

1 5 caused by these pathogens are particularly severe and difficult to treat in 

immunologically compromised patients. These include patients suffering from 
infection with the Human Immunodeficiency Virus (HIV), the virus that causes AIDS, 
as well as patients given immune-suppressive agents for treatment of cancer or 
autoimmune diseases. In particular, infections caused by various Mycobacterium 

20 species, including M. tuberculosis, M. bovis, M. avium, and M. intracellular e, are 
frequently the cause of disease in patients with AIDS. 

Therefore, it is apparent that new target sites for bacterial 
chemotherapy are needed if such pathogenic organisms are to be controlled. 

A unique characteristic of these pathogens and many Gram-positive 

25 bacteria is their surface display of proteins anchored to the cell wall. In fact, many of 
these molecules are known to be involved in essential cellular functions, including 
pathogenesis in a susceptible host. Thus, a possible disruption in this anchoring 
process may prove to be an effective treatment against these disease-causing 
elements. 

30 The anchoring of surface molecules to the cell wall in Gram-positive 

bacteria has been demonstrated to involve a conserved pathway, culminating in 
recognition of a conserved cleavage/anchoring site by some previously 
uncharacterized cellular machinery. Molecules whose ultimate location is the cell 
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wall must invariably be translocated across the single cellular membrane of these 
organisms. This is mediated for all cell wall anchored proteins by the well studied 
secretory pathway, involving cleavage of an amino-terminal signal peptide by a type I 
signal peptidase. Upon translocation of the molecule out of the cytoplasm, a 
5 mechanism must be present that extracellularly recognizes this protein as a substrate 
for anchoring. This process has been previously shown to involve the carboxyl- 
terminally located cell wall sorting signal, consisting of a highly conserved motif such 
as LPXTG (SEQ ID NO: 1), in which X can represent any of the twenty naturally 
occurring L-amino acids, followed by a series of hydrophobic residues and ultimately 

10 a sequence of positively-charged residues. Thus, once amino-terminally modified 
and successfully secreted, a polypeptide with this carboxyl-terminal sequence can 
present itself as a substrate to be processed by the anchoring machinery. At this time, 
cleavage of the sorting signal after the threonine residue is coupled with covalent 
linkage of the remainder of the polypeptide to the free amino group of the 

15 pentaglycine crossbridge in the cell wall. 

It is this transpeptidation reaction that anchors mature surface proteins 
so that the peptidoglycan layer, from which point the molecules can serve their 
biological functions. Therefore, there is a need to isolate and purify the enzyme that 
catalyzes this reaction. There is also a need to identify the gene encoding such an 

20 enzyme in order that the enzyme can be produced by genetic engineering techniques. 

Additionally, there is also a need to develop new methods for 
displaying proteins or peptides on the surfaces of bacteria. For many purposes, it is 
desirable to display proteins or peptides on the surfaces of bacteria so that the proteins 
or peptides are accessible to the surrounding solution, and can, for example, be bound 

25 by a ligand that is bound specifically by the protein or peptide. In particular, the 
display of proteins on the surface of bacteria is desirable for the preparation of 
vaccines, the linkage of molecules such as antibiotic molecules or diagnostic reagents 
to cells, for screening reagents such as monoclonal antibodies, and for the selection of 
cloned proteins by displaying the cloned proteins, then observing their reaction with 

30 specific reagents such as antibodies. One way of doing this has been with phage 
display (G.P. Smith, "Filamentous Fusion Phage: Novel Expression Vectors that 
Display Cloned Antigens on the Virion Surface," Science 228:1315-1316 (1985)). 
However, phage display is limited in its practicality, because it requires that the 
protein being displayed to be inserted into a coat protein of filamentous phage and 

35 retain its activity while not distorting the conformation of the coat protein, allowing 
functional virions to be formed. In general, this technique is therefore limited only to 
small peptide and proteins. 
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Therefore, there is a need for a more general method of peptide and 
protein display. 

SUMMARY 

5 The present invention is directed to sortase-transamidase enzymes 

from Gram-positive bacteria, particularly Staphylococcus aureus, and methods for 
their use, particularly in the areas of drug screening and peptide and protein display. 

One aspect of the present invention is a substantially purified sortase- 
transamidase enzyme from a Gram-positive bacterium, the enzyme catalyzing a 

10 reaction that covalently cross-links the carboxyl terminus of a protein having a sorting 
signal to the peptidoglycan of a Gram-positive bacterium, the sorting signal having a 
motif of LPX3X4G therein, wherein sorting occurs by cleavage between the fourth and 
fifth residues of the LPX3X4G motif. Typically, the Gram-positive bacterium is a 
species selected from the group consisting of Staphylococcus aureus, S. sobrinus, 

15 Enterococcus faecalis, Streptococcus pyogenes, and Listeria monocytogenes. 
Preferably, the Gram-positive bacterium is S. aureus. The enzyme may be a 
heterooligomer. 

Preferably, the enzyme has at least one subunit with a molecular weight 
of about 41,000 daltons and the sorting signal further includes: (2) a substantially 

20 hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and (3) a 
charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 

25 and X4 is selected from the group consisting of alanine, serine, and threonine. 

Preferably, the enzyme includes therein a subunit whose amino acid 
sequence is selected from the group consisting of: (1) D-P-K-L-K-E-I-Y-Q-I-V- 
L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S-R-N-Y-L- 
E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L-A- 

30 R-T-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-I-R-I-E- 
D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-T (SEQ ID NO: 
2); (2) M-V-K-V-T-D-Y-S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S- 
I-A-T-S-E-V-E-G-I-T-G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L- 
S-R-D-L-K-I-E-S-K-E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T- 

35 A-N-K-I-Q-T-S-I-F-N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I- 
V-I-E-K (SEQ ID NO: 31) and (3) sequences incorporating one or more 
conservative amino acid substitutions in SEQ ID NO:2 or SEQ ID NO: 31, wherein 
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the conservative amino acid substitutions are any of the following: (1) any of 
isoleucine, leucine, and valine for any other of these amino acids; (2) aspartic acid for 
glutamic acid and vice versa; (3) glutamine for asparagine and vice versa; and (4) 
serine for threonine and vice versa. 
5 Another aspect of the present invention is a nucleic acid sequence 

encoding this enzyme. In one alternative, the nucleic acid sequence includes therein a 
sequence selected from the group consisting of: (1) 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 

AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 

10 TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 

TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 

TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 

CTGGTGTTTATATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATT 

TTAATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTT 
15 ATAGTTTTAACATAA (SEQ ID NO: 28); (2) 

ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAAAAA 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 

20 TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAATTTCTAATATGAC 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30); and (3) a sequence complementary to SEQ ID NO: 
28 or SEQ ID NO: 30. In another alternative, the nucleic acid sequence can include a 

25 sequence hybridizing with SEQ ID NO: 28, SEQ ID NO: 30 or a sequence 

complementary to SEQ ID NO: 28 or SEQ ID NO: 30 with no greater than about a 
15% mismatch under stringent conditions. Preferably, the degree of mismatch is less 
than about 5%; more preferably, the degree of mismatch is less than about 2%. 

Yet another aspect of the present invention is a vector comprising the 

30 nucleic acid sequence of the present invention operatively linked to at least one 
control sequence that controls the expression or regulation of the nucleic acid 
sequence. 

Yet another aspect of the present invention is a host cell transfected 
with a vector of the present invention. 
35 Another aspect of the present invention is a method for producing a 

substantially purified sortase-transamidase enzyme. The method comprises the steps 
of: 



WO 99/09145 



PCT/US98/16229 



-5- 

( 1 ) culturing a host cell according to the present invention under 
conditions in which the host cell expresses the encoded sortase-transamidase enzyme; 
and 

(2) purifying the expressed enzyme to produce substantially purified 
5 sortase-transamidase enzyme. 

Another aspect of the present invention is a method for screening a 
compound for anti-sortase-transamidase activity. This method is important in 
providing a way to screen for antibiotics that disrupt the sorting reaction and are likely 
to be effective in treating infections caused by Gram-positive bacteria. 
10 In one alternative, the screening method comprises the steps of: 

(1) providing a substantially purified sortase-transamidase enzyme 
according to the present invention; 

(2) performing an assay for sortase-transamidase in the presence and in 
the absence of the compound; and 

15 (3) comparing the activity of the sortase-transamidase enzyme in the 

presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

In another alternative, the screening method comprises the steps of: 

(1) providing an active fraction of sortase-transamidase enzyme from 
20 a Gram-positive bacterium; 

(2) performing an assay for sortase-transamidase in the presence and in 
the absence of the compound; and 

(3) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 

25 transamidase activity. 

The active fraction of sortase-transamidase activity can be a particulate 
fraction from Staphylococcus aureus. 

The assay for sortase-transamidase enzyme can be performed by 
monitoring the capture of a soluble peptide that is a substrate for the enzyme by its 

30 interaction with an affinity resin. In one alternative, the soluble peptide includes a 
sequence of at least six histidine residues and the affinity resin contains nickel. In 
another alternative, the soluble peptide includes the active site of glutathione S- 
transferase and the affinity resin contains glutathione. In yet another alternative, the 
soluble peptide includes the active site of streptavidin and the affinity resin contains 

35 biotin. In still another alternative, the soluble peptide includes the active site of 
maltose binding protein and the affinity resin contains amylose. 
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Still another aspect of the present invention is an antibody specifically 
binding the sortase-transamidase enzyme of the present invention. 

Yet another aspect of the present invention is a protein molecule 
comprising a substantially purified sortase-transamidase enzyme according to the 
5 present invention extended at its carboxyl-terminus with a sufficient number of 
histidine residues to allow specific binding of the protein molecule to a nickel- 
sepharose column through the histidine residues added at the carboxyl-terminus. 

Still another aspect of the present invention is a method for displaying 
a polypeptide on the surface of a Gram-positive bacterium comprising the steps of: 
10 (1) expressing a polypeptide having a sorting signal at its carboxy- 

terminal end, the sorting signal having: (a) a motif of LPX3X4G therein; (b) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(c) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
15 being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(2) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) a substantially purified sortase-transamidase according to the present 

20 invention; and (iii) a Gram-positive bacterium having a peptidoglycan to which the 
sortase-transamidase can link the polypeptide; and 

(3) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4 motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 

25 peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

Another display method according to the present invention comprises: 

(1) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

30 carboxyl-terminal sorting signal as described above; 

(2) growing the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 
including therein a carboxyl-terminal sorting signal; and 

(3) binding the polypeptide covalently to the cell wall by the enzymatic 
35 action of a sortase-transamidase expressed by the Gram-positive bacterium involving 

cleavage of the chimeric protein within the LPX3X4G motif so that the polypeptide is 
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displayed on the surface of the Gram-positive bacterium in such a way that the 
polypeptide is accessible to a ligand. 

Another aspect of the present invention is a polypeptide displayed on 
the surface of a Gram-positive bacterium by covalent linkage of an amino-acid 
5 sequence of LPX3X4 derived from cleavage of an LPX3X4G motif, wherein X 3 is any 
of the twenty naturally-occurring L-amino acids and X4 is selected from the group 
consisting of alanine, serine, and threonine, the polypeptide being displayed on the 
surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand. 

10 Another aspect of the present invention is a covalent complex 

comprising: 

(1) the displayed polypeptide; and 

(2) an antigen or hapten covalently cross-linked to the polypeptide. 
Yet another aspect of the present invention is a method for vaccination 

15 of an animal comprising the step of immunizing the animal with the displayed 

polypeptide to generate an immune response against the displayed polypeptide, or, 
alternatively, with the covalent complex to generate an immune response against the 
antigen or the hapten. 

Still another aspect of the present invention is a method for screening 

20 for expression of a cloned polypeptide comprising the steps of: 

(1) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end as described above; 

(2) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) a substantially purified sortase-transamidase en2yme according to the 

25 present invention; and (iii) a Gram-positive bacterium having a peptidoglycan to 
which the sortase-transamidase can link the polypeptide through the sorting signal; 

(3) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

30 that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(4) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

35 Still another aspect of the present invention is a method for the 

diagnosis or treatment of a bacterial infection caused by a Gram-positive bacterium 
comprising the steps of: 
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(1) conjugating an antibiotic or a detection reagent to a protein 
including therein a carboxyl-terminal sorting signal as described above to produee a 
conjugate; and 

(2) introducing the conjugate to an organism infected with a Gram- 

5 positive bacterium in order to cause the conjugate to be sorted and covalently cross- 
linked to the cell walls of the bacterium in order to treat or diagnose the infection. 

If an antibiotic is used, typically it is a penicillin, ampicillin, 
vancomycin, gentamicin, streptomycin, a cephalosporin, amikacin, kanamycin, 
neomycin, paromomycin, tobramycin, ciprofloxacin, clindamycin, rifampin, 

10 chloramphenicol, norfloxacin, or a derivative of these antibiotics. 

Similarly, another aspect of the present invention is a conjugate 
comprising an antibiotic or a detection reagent covalently conjugated to a protein 
including therein a carboxyl-terminal sorting signal as described above to produce a 
conjugate. In still another aspect of the present invention, a composition comprises 

15 the conjugate with a pharmaceutically acceptable carrier. 

Another aspect of the present invention is a substantially purified 
protein having at least about 50% match with best alignment with the amino acid 
sequences of at least one of the putative Bacillus peptidase (SEQ ID NO: 3), the 
aminopeptidase P of Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of 

20 Lactobacillus delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase 
activity. Preferably, the match is at least about 60% in best alignment; more 
preferably, the match is at least about 70% in best alignment. 

Another aspect of the present invention is a substantially purified 
protein having sortase-transamidase activity and a hydrophobicity profile of at least 

25 one subunit of the protein that, determined as the mean absolute value of the 

hydrophobicity difference per residue, differs from the hydrophobicity profile of a 
putative Bacillus peptidase (SEQ ID NO: 3) by no more than about 2 units on the 
hydrophobicity scale. Preferably, the difference is not more than about 1 unit; most 
preferably, it is not more than about 0.5 units. 

30 Another aspect of the present invention is a substantially purified 

protein having sortase-transamidase activity and a hydrophobicity profile of at least 
one subunit of the protein that, determined as the mean absolute value of the 
hydrophobicity difference per residue, differs from the hydrophobicity profile of the 
sequence of SEQ ID NO: 31 by no more than about 2 units on the hydrophobicity 

35 scale. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features, aspects, and advantages of the present 
invention will become better understood with reference to the following description 
5 and accompanying drawings where: 

Figure 1 is a diagram showing the substantial homology of the amino 
acid sequence of the sortase-transamidase enzyme of Staphylococcus aureus to an 
open reading frame in the genome of Streptococcus pyogenes (SEQ ID NO: 2 & 34); 

Figure 2 is a diagram showing a greater homology of the amino acid 
10 sequence of the sortase-transamidase enzyme of Staphylococcus aureus to the 

carboxyl-terminal portion of the open reading frame in the genome of Streptococcus 
pyogenes (SEQ ID NO: 2 & 34); 

Figure 3 is the DNA sequence of the S. pyogenes open reading frame 
(SEQ ID NO: 33 & 34); 
15 Figure 4 (SEQ ID NO: 34) is the entire amino acid sequence of the 

protein translated from the entire S. pyogenes open reading frame; 

Figure 5 (SEQ ID NO: 3) is the amino acid sequence of a putative 
Bacillus peptidase in the GCVT-SPOIIIAA intergenic region; 

Figure 6 is the hydrophobicity profile of the protein whose amino acid 
20 sequence is shown in Figure 5 (SEQ ID NO: 3); 

Figure 7 (SEQ ID NO: 4) is the amino acid sequence of the 
aminopeptidase P of Lactococcus lactis; 

Figure 8 (SEQ ID NO: 5) is the amino acid sequence of the proline 
dipeptidase of Lactobacillus delbrueckii lactis; 
25 Figure 9 is a diagram of the activity of the sortase-transamidase 

enzyme of the present invention; 

Figure 10 (SEQ ID NOS: 28 & 29) is a partial DNA sequence of the 
gene for one of the subunits of the sortase-transamidase enzyme of £ aureus; 

Figure 1 1 (SEQ ID NO: 2) is the partial carboxyl-terminal amino acid 
30 sequence translated from the DNA sequence of Figure 10 (SEQ ID NOS: 28 & 29); 

Figure 12 (SEQ ID NOS: 30 & 3 1) is a partial DNA sequence of the 
gene for a second of the subunits of the sortase-transamidase enzyme of S. aureus; 
and 

Figure 13 is the hydrophobicity profile of the protein translated from 
35 the DNA sequence of Figure 12 (SEQ ID NOS: 30 & 31). 
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DEFINITIONS 

As used herein, the terms defined below have the following meanings 
unless otherwise indicated: 

"Nucleic Acid Sequence": the term "nucleic acid sequence" includes 
5 both DNA and RNA unless otherwise specified, and, unless otherwise specified, 

includes both double-stranded and single-stranded nucleic acids. Also included are 
hybrids such as DNA-RNA hybrids. In particular, a reference to DNA includes RNA 
that has either the equivalent base sequence except for the substitution of uracil and 
RNA for thymine in DNA, or has a complementary base sequence except for the 
10 substitution of uracil for thymine, complementarity being determined according to the 
Watson-Crick base pairing rules. Reference to nucleic acid sequences can also 
include modified bases as long as the modifications do not significantly interfere 
either with binding of a ligand such as a protein by the nucleic acid or with Watson- 
Crick base pairing. 

15 "Antibody": as used herein the term "antibody" includes both intact 

antibody molecules of the appropriate specificity, and antibody fragments (including 
Fab, F(ab'), Fv, and F(ab')2), as well as chemically modified intact antibody 
molecules and antibody fragments, including hybrid antibodies assembled by in vitro 
reassociation of subunits. Also included are single-chain antibody molecules 

20 generally denoted by the term sFv and humanized antibodies in which some or all of 
the originally non-human constant regions are replaced with constant regions 
originally derived from human antibody sequences. Both polyclonal and monoclonal 
antibodies are included unless otherwise specified. Additionally included are 
modified antibodies or antibodies conjugated to labels or other molecules that do not 

25 block or alter the binding capacity of the antibody. 

DESCRIPTION 

A substantially purified sortase-transamidase enzyme from Gram- 
positive bacteria, particularly Staphylococcus aureus. 
30 The properties of this enzyme make it a logical target for antibiotic action. This 
enzyme also catalyzes covalent crosslinkage of proteins to the peptidoglycan of 
Gram-positive bacteria. 

I. THE SORTASE-TRANSAMIDASE ENZYME 
35 One aspect of the invention is a substantially purified sortase- 

transamidase enzyme from a Gram-positive bacterium. As used herein, the term 
"substantially purified" means having a specific activity of at least tenfold greater than 
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the sortase-transamidase activity present in a crude extract, lysate, or other state from 
which proteins have not been removed and also in substantial isolation from proteins 
found in association with sortase-transamidase in the cell. 

One subunit of the enzyme has a molecular weight of about 41 ,000 
5 daltons. The enzyme catalyzes a reaction that covalently crosslinks the carboxyl- 
terminus of a protein having a sorting signal to the peptidogly can of the Gram- 
positive bacterium. The sorting signal has: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
1 0 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif. In this sorting signal, X 3 can be any of the twenty naturally-occurring 
L-amino acids. X4 can be alanine, serine, or threonine. Preferably, X4 is threonine. 

The sortase-transamidase is believed to occur in all Gram-positive 
15 bacteria. In particular, the enzyme exists in Mycobacterium, Nocardia, Actinomyces, 
Staphylococcus, Streptococcus, Listeria, Enterococcus, and Pneumococcus. 
Specifically, the enzyme exists in the following species: Staphylococcus aureus, S. 
sobrinus, Enterococcus faecalis, Streptococcus pyogenes, and Listeria 
monocytogenes. 

20 Preferably the enzyme is isolated from Staphylococcus aureus. 

A. Amino Acid Sequence 

The sortase-transamidase of the present invention includes therein an 
amino acid sequence, in one subunit of the enzyme, of D-P-K-L-K-E-I— Y-Q-I- 
V-L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S-R-N-Y- 

25 L-E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L- 
A-R-P-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-G-I- 
R-I-E-D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-T (SEQ 
ID NO:2). This sequence is at the carboxyl-terminal end of the subunit of the 
enzyme. 

30 The sortase-transamidase of the present invention also includes therein 

an amino acid sequence, in a second subunit of the enzyme, of M-V-K-V— T— D-Y- 
S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S-I-A-T-S-E-V-E-G-I-T- 
G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L-K-I-E-S-K- 
E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I-Q-T-S-I-F- 

35 N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I-V-I-E-K (SEQ ID NO: 
31). 
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Also within the scope of the present invention are substantially purified 
protein molecules that are mutants of the sequence of SEQ ID NO:2 or SEQ ID NO: 
3 1 that preserve the sortase-transamidase activity. In particular, the conservative 
amino acid substitutions can be any of the following: (1) any of isoleucine, leucine, 
5 and valine for any other of these amino acids; (2) aspartic acid for glutamic acid and 
vice versa; (3) glutamine for asparagine and vice versa; and (4) serine for threonine 
and vice versa. 

Other substitutions can also be considered conservative, depending 
upon the environment of the particular amino acid. For example, glycine (G) and 

10 alanine (A) can frequently be interchangeable, as can be alanine and valine (V). 

Methionine (M), which is relatively hydrophobic, can frequently be interchanged with 
leucine and isoleucine, and sometimes with valine. Lysine (K) and arginine (R) are 
frequently interchangeable in locations in which the significant feature of the amino 
acid residue is its charge and the different pK's of these two amino acid residues or 

15 their different sizes are not significant. Still other changes can be considered 

"conservative" in particular environments. For example, if an amino acid on the 
surface of a protein is not involved in a hydrogen bond or salt bridge interaction with 
another molecule, such as another protein subunit or a ligand bound by the protein, 
negatively charged amino acids such as glutamic acid and aspartic acid can be 

20 substituted for by positively charged amino acids such as lysine or arginine and vice 
versa. Histidine (H), which is more weakly basic than arginine or lysine, and is 
partially charged at neutral pH, can sometimes be substituted for these more basic 
amino acids. Additionally, the amides glutamine (Q) and asparagine (N) can 
sometimes be substituted for their carboxylic acid homologues, glutamic acid and 

25 aspartic acid. 

The sortase-transamidase from Staphylococcus aureus has substantial 
homology the amino acid sequence of the first subunit, that of SEQ ID NO: 2, with an 
open reading frame in the genome of Streptococcus pyogenes, particularly in the 
amino-termmal region. There is about a 22% match with best alignment over the 

30 entire sequenced region of the S. pyogenes open reading frame, and about a 47% 

match with best alignment over the carboxyl-terminal region of the S. pyogenes open 
reading frame. These matches are shown in Figures 1-2. The DNA sequence of the 
entire S. pyogenes open reading frame is shown in Figure 3 (SEQ ID NO: 33 & 34). 
The protein translated from the entire S. pyogenes open reading frame has a molecular 

35 weight of about 40,851 .43 daltons; its sequence is shown in Figure 4 (SEQ ID NO: 
34). Therefore, another aspect of the present invention is a substantially purified 
protein molecule that has at least one subunit of about 40,000 to about 41,000 daltons 
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in molecular weight, the subunit having at least a 20% match with best alignment with 
the S. pyogenes open reading frame of Figure 2 and that has sortase-transamidase 
activity. Preferably, the subunit has at least a 30% match with best alignment; more 
preferably, at least 50% match with best alignment. 
5 As shown below in the Example, the first 364 bases of a nucleic acid 

segment that complements a temperature-sensitive mutation in the S. aureus sortase- 
transamidase, designated the SM-3 1 7 complementing gene insert, has been identified 
as encoding a protein sequence that is a homologue of a putative Bacillus peptidase in 
the GCVT-SPOIIIAA intergenic region (GenBank Accession No. 1731048; Y. 

10 Kobayashi et aL). The sequence of this putative peptidase is shown in Figure 5 (SEQ 
ID NO:3) and its hydrophobicity profile is shown in Figure 6. The hydrophobicity is 
calculated according to the method of J. Kyte & R.F. Doolittle, "A Simple Method for 
Displaying the Hydropathic Character of a Protein," J. Mol. Biol. 157: 105-132 
(1982). As used herein, the term "hydrophobicity" is the hydrophobicity as calculated 

15 in Kite & Doolittle, supra . 

To a lesser degree of homology, the protein sequence encoded by this 
complementing gene insert is homologous to aminopeptidase P of Lactococcus lactis 
(GenBank Accession No. 1915907; J. Matos). The amino acid sequence of this 
aminopeptidase is shown in Figure 7 (SEQ ID NO: 4). To a still lesser degree of 

20 homology, the protein sequence encoded by this complementing gene insert is 

homologous to the proline dipeptidase of Lactobacillus delbrueckii lactis (GenBank 
Accession No. 1 172066; K. Stucky et aL, "Cloning and DNA Sequence Analysis of 
pepQ, a Prolidase Gene from Lactobacillus delbrueckii subsp. lactis and Partial 
Characterization of Its Product," Mol. Gen. Genet. 247: 494-500 (1995)). The amino 

25 acid sequence of this proline dipeptidase is shown in Figure 8 (SEQ ID NO:5). 

Because of the relatedness of these proteins, another aspect of the 
present invention is a substantially purified protein having at least one subunit with at 
least about 50% match with best alignment with the amino acid sequences of at least 
one of the putative Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of 

30 Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of Lactobacillus 
delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase activity. 
Preferably, the at least one subunit of the protein has at least about 60% match with 
best alignment with at least one of these sequence; more preferably, the at least one 
subunit of the protein has at least about 70% match with best alignment with at least 

35 one of these sequences. 

Because the hydrophobicity of a protein is a sensitive measure of 
protein structure, another aspect of the invention is a substantially purified protein 
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having sortase-transamidase activity and a hydrophobicity profile for at least one 
subunit of the protein that, determined as the mean absolute value of the 
hydrophobicity difference per residue, differs from the hydrophobicity profile of the 
putative Bacillus peptidase by no more than about 2 units on the hydrophobicity scale 
5 of Kyte & Doolittle, supra . Preferably, the difference is no greater than about 1 unit; 
more preferably, the difference is no greater than about 0.5 units. 

The sortase-transamidase is a cysteine protease. 

B. Activity of the Sortase-Transamidase 

10 The activity of the sortase-transamidase enzyme of the present 

invention is shown, in general, in Figure 9. The enzyme first cleaves a polypeptide 
having a sorting signal within the LPX3X4G motif Cleavage occurs after residue X4, 
normally a threonine; as indicated above, this residue can also be a serine or alanine 
residue. This residue forms a covalent intermediate with the sortase. The next step is 

15 the transamidation reaction that transfers the cleaved carboxyl terminus of the protein 
to be sorted to the -NH 2 of the pentaglycine crossbridge within the peptidoglycan 
precursor. The peptidoglycan precursor is then incorporated into the cell wall by a 
transglycosylase reaction with the release of undecaprenyl phosphate. The mature 
anchored polypeptide chains are thus linked to the pentaglycine cross bridge in the cell 

20 wall which is tethered to the s-amino side chain of an unsubstituted cell wall 
tetrapeptide. A carboxypeptidase may cleave a D-Ala-D-Ala bond of the 
pentapeptide structure to yield the final branched anchor peptide in the staphylococcal 
cell wall. 

The sorting signal has: (1) a motif of LPX3X4G therein; (2) a 
25 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region. 

In the motif, X 3 can be any of the 20 naturally-occurring L-amino 
acids. X4 can be any of threonine, serine, or alanine. Preferably, X4 is threonine (O. 
Schneewind et al., "Cell Wall Sorting Signals in Surface Proteins of Gram-Positive 
30 Bacteria," EMBO J. 12:4803-481 1 (1993)). 

Preferably, the substantially hydrophobic domain carboxyl to the motif 
includes no more than about 7 charged residues or residues with polar side chains. 
For the purposes of this specification, these residues include the following: aspartic 
acid, glutamic acid, lysine, and arginine as charged residues, and serine, threonine, 
35 glutamine, and asparagine as polar but uncharged residues. Preferably, the sequence 
includes no more than three charged residues. 



WO 99/09145 



PCT/US98/16229 



-15- 

Representative sequences suitable for sorting signals for use with the 
sortase-transamidase of the present invention include, but are not limited to the * 
following: E-E-N-P-F-I-G-T-T-V-F-G-G-L-S-L-A-L-G-A-A-L-L-A-G 
(SEQ ID NO: 6), the hydrophobic domain of the staphylococcal proteinase (SPA) 
5 sorting signal from Staphylococcus aureus; (2) G-E-E-S-T-N-K-G-M-L-F-G- 
G-L-F-S-I-L-G-L-A-L-L (SEQ ID NO:7), the SNBP signal of S aureus; (3) D- 
S-S-N-A-Y-L-P-L-L-G-L-V-S-L-T-A-G-F-S-L-L-G-L (SEQ ID NO: 8), 
the SPAA signal of S. sobrinus, (4) E-K-Q-N-VHL-L-T-V-V-G-S-L-A-A-M- 
L-G-L-A-G-L-G-F (SEQ ID NO:9), the PRGB signal of Enterococcus faecalis, 

10 (5) S-I-G-T-Y-L-F-K-I-G-S-A-A-M-I-G-A-I-G-I-Y-I-V (SEQ ID NO: 1 0), 
the TEE signal of Streptococcus pyogenes, and (6) D-S-D-N-A-L-Y-L-L— L-G- 
L-L-A-V-G-T-A-M-A-L-T (SEQ ID NO: 1 1), the INLA signal of Listeria 
monocytogenes. Other hydrophobic domains can be used as part of the sorting signal. 

The third portion of the sorting signal is a charged tail region with at 

15 least two positively charged residues carboxyl to the substantially hydrophobic 

domain. At least one of the two positively charged residues is arginine. The charged 
tail can also contain other charged amino acids, such as lysine. Preferably, the 
charged tail region includes two or more arginine residues. The two positively 
charged residues are located at residues 31-33 from the motif. Preferably, the two 

20 arginine residues are either in succession or are separated by no more than one 

intervening amino acid. Preferably, the charged tail is at least five amino acids long, 
although four is possible. Among the charged tails that can be used are the following: 
(1) R-R-R-E-L (SEQ ID NO: 12), from the SPA signal of £ aureus; (2) R-R-N-K- 
K-N-H-K-A (SEQ ID NO: 13), from the SNBP signal of S. aureus; (3) R-R-K-Q- 

25 D (SEQ ID NO: 14), from the SPAA signal of S sobrinus; (4) K-R-R-K-E-T-K 
(SEQ ID NO: 15), from the PRGB signal of E. faecal is; (5) K-R-R-K-A (SEQ ID 
NO: 16), from the TEE signal of S. pyogenes; (6), K-R-R-H-V-A-K-H (SEQ ID 
NO: 17), from the FIM sorting signal of Actinomyces viscosus, and (7) K-R-R-K-S 
(SEQ ID NO: 1 8), from the BAC sorting signal of Streptococcus aglactiae; (8) K-R- 

30 K-E-E-N (SEQ ID NO: 1 9), from the EMM signal of Streptococcus pyogenes. 

Also usable as the charged tail portion of the sorting signal are the 
following sequences produced by mutagenesis from the SPA signal of S. aureus. 
These include R-R-R-E-S (SEQ ID NO: 20), R-R-R-S-L (SEQ ID NO: 21), R- 
R-S-E-L (SEQ ID NO: 22), R-S-R-E-L (SEQ ID NO: 23) and S-R-R-E-L (SEQ 

35 ID NO: 24). Other charged tails that are usable as part of the sorting signal can be 
derived from a poly serine tail, itself inactive, by replacement of one or more of the 
serine residues with the basic amino acid arginine. These include R-R-S-S-S (SEQ 
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ID NO: 25), R-S-R-S-S (SEQ ID NO:26), and S-R-R-S-S (SEQ ID NO:27). 
Other sorting signals can also be used. 



II. THE GENE ENCODING THE S ORT A SE-TRAN S AMID A S E ENZYME 
5 A. Isolation of the Sortase-Transamidase Enzvme Gene 

The gene for the sortase-transamidase enzyme in Staphylococcus 
aureus has been isolated. The isolation process is described in detail in the Example 
below; in general, this process comprises: (1) the generation of temperature-sensitive 
mutants through chemical mutagenesis, such as with the DNA modifying agent N- 

10 methyl-N-nitro-N-nitrosoguanidine; (2) Screening for temperature-sensitive 

mutants; (3) screening the temperature-sensitive mutants for a block in protein sorting 
by the use of a construct harboring the staphylococcal enterotoxin B (SEB) gene fused 
to the cell wall sorting signal of staphylococcal Protein A (SPA), to locate mutants 
that accumulate a precursor molecule formed by cleavage of an amino-terminal signal 

15 peptide but that is not then processed by cleavage of the carboxyl-terminal sorting 

signal; (4) generation of a S. aureus chromosomal library and complementation of the 
temperature-sensitive sorting defect; and (5) sequencing and characterization of the S. 
aureus complementing determinants. 



20 B. Sequence of the Sortase-Transamidase Gene 

The above procedure yielded a partial sequence for one of the subunits 
of the sortase-transamidase including the carboxyl-terminal portion of the gene for 
the first subunit. This sequence is 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
25 AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 
TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 
TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATTTT 
30 AATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTTA 
TAGTTTTAACATAA (SEQ ID NO:28 & 29). The last three nucleotides, TAA, of 
this sequence are the stop codon. 

The above procedure further yielded a sequence for a second subunit of 
ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
35 AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAAAAA 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 
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TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAATTTCTAATATGAC 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30 & 31) The last three nucleotides of this sequence, 
5 TAA, are the stop codon. 

Accordingly, within the scope of the present invention is a nucleic acid 
sequence encoding a substantially purified sortase-transamidase enzyme from a 
Gram-positive bacterium. The enzyme encoded has at least one subunit with a 
molecular weight of about 41,000 daltons and catalyzes a reaction that covalently 
10 cross-links the carboxyl-terminus of a protein having the sorting signal described 
above to the peptidoglycan of a gram-positive bacterium. The nucleic acid sequence 
includes therein the sequence of SEQ ID NO: 28 or a sequence complementary to 
SEQ ID NO: 28, or the sequence of SEQ ID NO: 30 or a sequence complementary to 
SEQ ID NO: 30. 

15 Also included within the present invention is a nucleic acid sequence 

encoding a substantially purified sortase-transamidase enzyme from a Gram-positive 
bacterium with at least one subunit with a molecular weight of about 41,000 daltons, 
where the enzyme catalyzes the cross-linking reaction where the nucleic acid 
sequence hybridizes with at least one of: (1) the sequence of SEQ ID NO: 28; (2) a 

20 sequence complementary to SEQ ID NO: 28; (3) the sequence of SEQ ID NO: 30; or 
(4) a sequence complementary to SEQ ID NO: 30 with no greater than about a 15% 
mismatch under stringent conditions. Preferably, the degree of mismatch is no greater 
than about 5%; most preferably the mismatch is no greater than about 2%. 

Also within the present invention is a nucleic acid sequence encoding a 

25 substantially purified sortase-transamidase enzyme from a Gram-positive bacterium 
where the enzyme has at least one subunit with a molecular weight of about 41,000 
daltons and catalyzes the cross-linking reaction described above involving the sorting 
signal, where the enzyme includes therein an amino acid sequence selected from the 
group consisting of: (1) D-P-K-L-K-E-I-Y-Q-I-V-L-E-S-Q-M-K-A-I-N-E- 

30 I-R-P-D-M-T-G-A-E-A-D-A-I-S-R-N-Y-L-E-S-K-G-Y-G-K-E-F-G- 
H-S-L-G-H-G-I-G-L-E-I-H-E-G-P-M-L-A-R-T-I-Q-D-K-L-Q-V-N- 
N-C-V-T-V-E-P-G-V-Y-I-E-G-L-G-I-R-I-E-D-D-I-L-I-T-E-N-G-C- 
Q-V-F-T-K-C-T-K-D-L-I-V-L-T (SEQ ID NO:2); (2) M-V-K-V-T-D-Y- 

S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S-I-A-T-S-E-V-E-G-I-T- 
35 G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L-K-I-E-S-K- 
E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I-Q-T-S-I-F- 
N-S-I-S-N-M-T-A-I-E-P-K-Q-I-N-I-H-I-T-Q-I-V-I-E-K (SEQ ID NO: 
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31); and (3) sequences incorporating one or more conservative amino acid 
substitutions in SEQ ID NO: 2 or SEQ ID NO: 3 1 wherein the conservative amino acid 
substitutions are any of the following: (1) any of isoleucine, leucine and valine for any 
other of these amino acids; (2) aspartic acid for glutamic acid and vice versa; (3) 
5 glutamine for asparagine and vice versa; and (4) serine for threonine and vice versa. 
Alternative nucleic acid sequences can be determined using the standard genetic code; 
the alternative codons are readily determinable for each amino acid in this sequence. 

Construction of nucleic acid sequences according to the present 
invention can be accomplished by techniques well known in the art, including solid- 
10 phase nucleotide synthesis, the polymerase chain reaction (PCR) technique, reverse 
transcription of DNA from RNA, the use of DNA polymerases and ligases, and other 
techniques. If an amino acid sequence is known, the corresponding nucleic acid 
sequence can be constructed according to the genetic code. 

15 C. Vectors and Host Cells Transformed with Vectors 

Another aspect of the invention is a vector comprising a nucleic acid 
sequence according to the present invention operatively linked to at least one control 
sequence that controls the expression or regulation of the nucleic acid sequence. Such 
control sequences are well known in the art and include operators, promoters, 

20 enhancers, promoter-proximal elements and replication origins. The techniques of 
vector construction, including cloning, ligation, gap-filling, the use of the polymerase 
chain reaction (PCR) procedure, solid-state oligonucleotide synthesis, and other 
techniques, are all well known in the art and need not be described further here. 

Another aspect of the present invention is a host cell transfected with a 

25 vector according to the present invention. Among the host cells that can be used are 
gram-positive bacteria such as Staphylococcus aureus. 

Transfection, also known as transformation, is done using standard 
techniques appropriate to the host cell used, particularly Staphylococcus aureus. Such 
techniques are described, for example, in R.P. Novick, "Genetic Systems in 

30 Staphylococci," Meth. Enzvmol. 204: 587-636 (1991 ), as well as in O. Schneewind et 
al, "Sorting of Protein A to the Staphylococcal Cell Wall," Cell 70: 267-281 (1992). 

III. S ORT A S E-TRAN S AMI D A SE AS A TARGET FOR ANTIBIOTIC ACTION 
A. A Site for Antibiotic Action 
35 The reaction carried out by the sortase-transamidase of the present 

invention presents a possible target for a new class of antibiotics to combat medically 
relevant infections caused by numerous gram-positive organisms. Because this is a 
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novel site of antibiotic action, these antibiotics have the advantage that resistance by 
the bacterium has not had a chance to develop. 

Such antibiotics can include compounds with structures that mimic the 
cleavage site, such as compounds with a structure similar to methyl 
5 methanethiosulfonate or, more generally, alkyl methanethiosulfonates. The sortase- 
transamidase of the present invention is believed to be a cysteine protease. Other 
antibiotics that may inhibit the activity of the sortase-transamidase in the present 
invention include inhibitors that would be specific for cysteine-modification in a (3- 
lactam framework. These inhibitors would have active moieties that would form 

10 mixed disulfides with the cysteine sulfhydryl. These active moieties could be 

derivatives of methanethiosulfonate, such as methanethiosulfonate ethylammonium, 
methanethiosulfonate ethyltrimethylammonium, or methanethiosulfonate 
ethylsulfonate (J.A. Javitch et al., "Mapping the Binding Site Crevice of the 
Dopamine D2 Receptor by the Substituted-Cysteine Accessibility Method," Neuron . 

15 14: 825-831 (1995); M.H. Akabas & A. Karlin, "Identification of Acetylcholine 
Receptor Channel-Lining Residues in the Ml Segment of the a-Subunit," 
Biochemistry 34: 12496-12500 (1995)). Similar reagents, such as alkyl 
alkanethiosulfonates, i.e., methyl methanethiosulfonate, or alkoxycarbonylalkyl 
disulfides, have been described (D.J. Smith et al, "Simple Alkanethiol Groups for 

20 Temporary Blocking of Sulfhydryl Groups of Enzymes/' Biochemistry 14: 766-771 
(1975); W.N. Valentine & D.E. Paglia, "Effect of Chemical Modification of 
Sulfhydryl Groups of Human Erythrocyte Enzymes." Am. J. Hematol. 11:11 1-124 
(1981)). Other useful inhibitors involve derivatives of 2-trifluoroacetylaminobenzene 
sulfonyl fluoride (J.C. Powers, "Proteolytic Enzymes and Their Active-Site-Specific 

25 Inhibitors: Role in the Treatment of Disease," in Modification of Proteins ), in a (3- 
lactam framework, peptidyl aldehydes and nitriles (E. Dufour et al., "Peptide 
Aldehydes and Nitriles as Transition State Analog Inhibitors of Cysteine Proteases," 
Biochemistry 34: 9136-9143 (1995); J. O. Westerik & R. Wolfenden, "Aldehydes as 
Inhibitors of Papain," J. Biol. Chem. 247: 8195-8197 (1972)), peptidyl diazomethyl 

30 ketones (L. Bjorck et al., "Bacterial Growth Blocked by a Synthetic Peptide Based on 
the Structure of a Human Proteinase Inhibitor," Nature 337: 385-386 (1989)), 
peptidyl phosphonamidates (P.A. Bartlett & C.K. Marlowe, "Phosphonamidates as 
Transition-State Analogue Inhibitors of Thermolysin," Biochemistry 22: 4618-4624 
(1983)), phosphonate monoesters such as derivatives or analogues of m- 

35 carboxyphenyl phenylacetamidomethylphosphonate (R.F. Pratt, "Inhibition of a Class 
C p-Lactamase by a Specific Phosphonate Monoester," Science 246: 917-919 
(1989)), maleimides and their derivatives, including derivatives of such bifunctional 
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maleimides as o-phenylenebismaleimide, p-phenylenebismaleimide, m- 
phenylenebismaleimide, 2,3-naphthalenebismaleimide, 1,5— 
naphthalenebismaleimide, and azophenylbismaleimide, as well as monofunctional 
maleimides and their derivatives (J.V. Moroney et al. 5 "The Distance Between Thiol 
5 Groups in the y Subunit of Coupling Factor 1 Influences the Proton Permeability of 
Thylakoid Membranes." J, Bioenerget Biomembr. 14: 347-359 (1982)), peptidyl 
halomethyl ketones (chloromethyl or fluoromethyl ketones), peptidyl sulfonium salts, 
peptidyl acyloxymethyl ketones, derivatives and analogues of epoxides, such as E-64 
(N-[N-(L-trans-carboxyoxiran-2-carbonyl)-L-leucylagmatine), E-64c (a 

10 derivative of E— 64 in which the agmatine moiety is replaced by an isoamylamine 
moiety), E-64c ethyl ester, Ep-459 (an analogue of E-64 in which the agmatine 
moiety is replaced by a 1 ,4-diaminopropyl moiety), Ep-479 (an analogue of E-64 in 
which the agmatine moiety is replaced by a 1,7-diheptylamino moiety), Ep-460 (a 
derivative of Ep-459 in which the terminal amino group is substituted with a Z 

15 (benzyloxycarbonyl) group), Ep-174 (a derivative of E-64 in which the agmatine 
moiety is removed, so that the molecule has a free carboxyl residue from the leucine 
moiety), Ep-475 (an analogue of E-64 in which the agmatine moiety is replaced with 
a NH2-(CH 2 )2-CH-(CH 3 )2 moiety), or Ep-420 (a derivative of E-64 in which the 
hydroxyl group is benzoylated, forming an ester, and the leucylagmatine moiety is 

20 replaced with isoleucyl-O-methyltyrosine), or peptidyl O-acyl hydroxamates (E 
Shaw, "Cysteinyl Proteases and Their Selective Inactivation), pp 271-347). Other 
inhibitors are known in the art. 

B. Screening Methods 
25 Another aspect of the present invention is a method for screening a 

compound for anti-sortase-transamidase activity. This is an important aspect of the 
present invention, because it provides a method for screening for compounds that 
disrupt the sorting process and thus have potential antibiotic activity against Gram- 
positive bacteria. 

30 In general, this method comprises the steps of: (1) providing an active 

fraction of sortase-transamidase enzyme; (2) performing an assay for sortase- 
transamidase activity in the presence and in the absence of the compound being 
screened; and (3) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound. 

35 The active fraction of sortase-transamidase enzyme can be a 

substantially purified sortase-transamidase enzyme preparation according to the 
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present invention, but can be a less purified preparation, such as a partially purified 
particulate preparation as described below. 

The enzymatic activity can be measured by the cleavage of a suitable 
substrate, such as the construct having the Staphylococcal Enterotoxin B (SEB) gene 
5 fused to the cell wall sorting signal of Staphylococcal Protein A (SPA). The cleavage 
can be determined by monitoring the molecular weight of the products by sodium 
dodecyl sulfate-polyacrylamide gel electrophoresis or by other methods. 

One particularly preferred assay for sortase-transamidase activity is the 

following: 

10 Staphylococcal soluble RNA (sRNA) is prepared from S. aureus by a 

modification of the technique of Zubay (G. Zubay, J. Mol. Biol. 4: 347-356 (1962)). 
An overnight culture of S. aureus is diluted 1 : 10 in TSB and incubated at 37°C for 3 
hr. The cells are harvested by centrifugation at 6000 rpm for 1 5 min. 

For every gram of wet cell pellets, 2 ml of 0.01 M magnesium acetate, 

15 0.001 M Tris, pH 7.5 is used to suspend the pellets. The cell pellets are beaten by 

glass bead beater for 45 minutes in 5 minute intervals. The suspension is centrifiiged 
twice at 2500 rpm for 5 minutes to remove the glass beads, then 0.5 ml phenol is 
added to the suspension. The suspension is vigorously shaken for 90 minutes at 4°C, 
and then centrifuged at 18,000 x g for 15 minutes. The nucleic acids in the top layer 

20 are precipitated by addition of 0.1 volume of 20% potassium acetate and 2 volumes of 
ethanol, then stored at 4°C for at least 36 hours. The precipitate is obtained by 
centrifugation at 5,000 x g for 5 minutes. Cold NaCl (1 ml) is added to the precipitate 
and stirred at 4°C for 1 hour. The suspension is centrifuged at 15,000 x g for 30 
minutes. The sediments are washed with 0.5 ml of cold 1 M NaCl. The supernatants 

25 are combined and 2 volumes of ethanol is added to precipitate the tRNA. The 

precipitate is suspended in 0.1 ml of 0.2 M glycine, pH 10.3 and incubated for 3 hr at 
37°C. This suspension is then made 0.4 M in NaCl and the RNA is precipitated by 
addition of 2 volumes of ethanol. The precipitate is dissolved in 0.7 ml of 0.3 M 
sodium acetate, pH 7.0. To this is slowly added 0.5 volume of isopropyl alcohol, with 

30 stirring. The precipitate is removed by centrifugation at 8,000 x g for 5 min. This 

precipitate is redissolved in 0.35 ml of 0.3 M sodium acetate, pH 7.0. To this is added 
0.5 volume of isopropyl alcohol, using the same procedure as above. The precipitate 
is also removed by centrifugation. The combined supernatants from the two 
centrifugations are treated further with 0.37 ml of isopropyl alcohol. The resulting 

35 precipitate is dissolved in 75 jlxI of water and dialyzed against water overnight at 4°C. 
This sRNA is used in the sortase-transamidase assay. 
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Particulate sortase-transamidase enzyme is prepared for use in the 
assay by a modification of the procedure of Chatterjee & Park (A.N. Chatterjee &J.T. 
Park, Proc. Natl. Acad. Sci. USA 51 : 9-16 (1964)). An overnight culture of S. aureus 
OS2 is diluted 1 :50 in TSB and incubated at 37°C for 3 hr. Cells are harvested by 
5 centrifugation at 6000 rpm for 1 5 minutes, and washed twice with ice-cold water. 
The cells are disrupted by shaking 7 ml of 1 3% suspension of cells in 0.05 M Tris- 
HC1 buffer, pH 7.5, 0.1 mM MgCl 2 , and 1 mM 2-mercaptoethanol with an equal 
volume of glass beads for 10-15 minutes in a beater. The glass beads are removed by 
centrifugation at 2000 rpm for 5 minutes. The crude extract is then centrifuged at 
10 15,000 x g for 5 minutes. The supernatant is centrifuged again at 100,000 x g for 30 
minutes. The light yellow translucent pellet is resuspended in 2 to 4 ml of 0.02 M 
Tris-HCl buffer, pH 7.5, containing 0.1 mM MgCl 2 and 1 mM 2-mercaptoethanol. 
This suspension represents the crude particulate enzyme and is used in the reaction 
mixture below. 

1 5 The supernatant from centrifugation at 1 00,000 x g is passed through 

gel filtration using a Sephadex® G-25 agarose column (Pharmacia) to remove 
endogenous substrates. This supernatant is also used in the reaction mixture. 

The complete reaction mixture contains in a final volume of 30 \xl (M. 
Matsuhashi et al., Proc. Natl. Acad. Sci. USA 54: 587-594 (1965)): 3 pmol of Tris- 

20 HC1, pH 7.8; 0.1 ^imol of MgCl 2 ; 1.3 ^imol of KC1; 2.7 nmol of [ 3 H] glycine (200 

j^Ci/jimol); 2 nmol of UDP-M-pentapeptide; 5 nmol of UDP-N-acetylglucosamine; 
0.2 nmol of ATP; 0.05 junol of potassium phosphoenolpyruvate; 2.05 ^ig of 
chloramphenicol; 5 |ig of pyruvate kinase; 0.025 jimol of 2-mercaptoethanol; 50 ng 
of staphylococcal sRNA prepared as above; 4 jag (as protein) of supernatant as 

25 prepared above; 271 [ig of particulate enzyme prepared as above; and 8 nmol of a 
synthesized soluble peptide (HHHHHHAQALEPTGEENPF) (SEQ ID NO: 32) as a 
substrate. 

The mixture is incubated at 20°C for 60 minutes. The mixture is then 
heated at 100°C for 1 minute. The mixture is diluted to 1 ml and precipitated with 50 

30 jal nickel resin, and washed with wash buffer (1% Triton X-100, 0.1% sodium 
dodecyl sulfate, 50 mM Tris, pH 7.5). The nickel resin beads are counted in a 
scintillation counter to determine 3 H bound to the beads. 

The effectiveness of the compound being screened to inhibit the 
activity of the sortase-transamidase enzyme can be determined by adding it to the 

35 assay mixture in a predetermined concentration and determining the resulting degree 
of inhibition of enzyme activity that results. Typically, a dose-response curve is 
generated using a range of concentrations of the compound being screened. 
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The particulate enzyme preparation of sortase— transamidase employed 
in this protocol can be replaced with any other sortase-transamidase preparation, - 
purified or crude, staphylococcal, recombinant, or from any other source from any 
other Gram-positive bacterium as described above. 
5 The soluble peptide is captured in this embodiment by its affinity for 

nickel resin as a result of the six histidine residues. More than six histidine residues 
can be used in the peptide. As an alternative, the soluble peptide can be captured by 
an affinity resulting from other interactions, such as streptavidin-biotin, glutathione 
S-transferase-glutathione, maltose binding protein-amylose, and the like, by 

10 replacing the six histidine residues with the amino acid sequence that constitutes the 
binding site in the peptide and employing the appropriate solid phase affinity resin 
containing the binding partner. Suitable peptides can be prepared by solid phase 
peptide synthesis using techniques well known in the art, such as those described in 
M. Bodanszky, "Peptide Chemistry: A Practical Textbook" (2d ed., Springer- Verlag, 

15 Berlin, 1993). For example, if the glutathione S-transferase-glutathione interaction is 
used, the active site of glutathione S-transferase (DJB. Smith & K.S. Johnson, 
"Single-Step Purification of Polypeptides Expressed in Escherichia coli as Fusions 
with Glutathione S-Transferase," Gene 67: 3 1-40 (1 988)) can be substituted for the 
six histidine residues, and glutathione can be bound to the solid support. 

20 

IV. USE OF SORTASE-TRANSAMIDASE FOR PROTEIN AND PEPTIDE 
DISPLAY 

A. Methods for Protein and Peptide Display 

The sortase-transamidase enzyme of the present invention can also be 
25 used in a method of displaying a polypeptide on the surface of a gram-positive 
bacterium. 

In general, a first embodiment of this method comprises the steps of: 
(1) expressing a polypeptide having a sorting signal at its carboxyl-terminal end as 
described above; (2) forming a reaction mixture including: (i) the expressed 

30 polypeptide; (ii) a substantially purified sortase-transamidase enzyme; and (iii) a 

Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and (3) allowing the sortase-transamidase to catalyze a 
reaction that cleaves the polypeptide within the LPX3X4G motif of the sorting signal 
and covalently cross-links the amino-terminal portion of the cleaved polypeptide to 

35 the peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 
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In this method, the polypeptide having the sorting signal at its 
carboxy-terminal end need not be expressed in a Gram-positive bacterium; it can be 
expressed in another bacterial system such as Escherichia coli or Salmonella 
typhimurium, or in a eukaryotic expression system. 
5 The other method for protein targeting and display relies on direct 

expression of the chimeric protein in a Gram-positive bacterium and the action of the 
sortase-transamidase on the expressed protein. In general, such a method comprises 
the steps of: (1) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

10 carboxyl-terminal sorting signal as described above, the chimeric protein including 
the polypeptide to be displayed; (2) growing the bacterium into which the nucleic acid 
segment has been cloned to express the cloned chimeric protein to generate a chimeric 
protein including therein a carboxyl-terminal sorting signal; and (3) covalent binding 
of the chimeric protein to the cell wall by the enzymatic action of the sortase- 

1 5 transamidase involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the protein is displayed on the surface of the gram-positive bacterium in such a 
way that the protein is accessible to a ligand. 

Typically, the Gram-positive bacterium is a species of Staphylococcus. 
A particularly preferred species of Staphylococcus is Staphylococcus aureus. 

20 However, other Gram-positive bacteria such as Streptococcus 

pyogenes, other Streptococcus species, and Gram-positive bacteria of other genera 
can also be used. 

Cloning the nucleic acid segment encoding the chimeric protein into 
the Gram-positive bacterium is performed by standard methods. In general, such 

25 cloning involves: (1) isolation of a nucleic acid segment encoding the protein to be 
sorted and covalently linked to the cell wall; (2) joining the nucleic acid segment to 
the sorting signal; (3) cloning by insertion into a vector compatible with the Gram- 
positive bacterium in which expression is to take place; and (4) incorporation of the 
vector including the new chimeric nucleic acid segment into the bacterium. 

30 Typically, the nucleic acid segment encoding the protein to be sorted is 

DNA; however, the use of RNA in certain cloning steps is within the scope of the 
present invention. 

When dealing with genes from eukaryotic organisms, it is preferred to 
use cDNA, because the natural gene typically contains intervening sequences or 

35 introns that are not translated. Alternatively, if the amino acid sequence is known, a 
synthetic gene encoding the protein to be sorted can be constructed by standard solid- 
phase oligodeoxy ribonucleotide synthesis methods, such as the phosphotriester or 



WO 99/09145 



PCT/US98/16229 



-25- 

phosphite triester methods. The sequence of the synthetic gene is determined by the 
genetic code, by which each naturally occurring amino acid is specified by one or 
more codons. Additionally, if a portion of the protein sequence is known, but the 
gene or messenger RNA has not been isolated, the amino acid sequence can be used to 
5 construct a degenerate set of probes according to the known degeneracy of the genetic 
code. General aspects of cloning are described, for example, in J. Sambrook et al., 
"Molecular Cloning: A Laboratory Manual" (2d ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York, 1989); in B. Perbal, "A Practical Guide to 
Molecular Cloning" (2d ed., John Wiley & Sons, New York 1988), in S.L. Berger & 

10 A.R. Kimmel, "Guide to Molecular Cloning Techniques" (Methods in Enzymology, 
vol. 152, Academic Press, Inc., San Diego, 1987), and inD.V. Goeddel, ed., "Gene 
Expression Technology" (Methods in Enzymology, vol. 185, Academic Press, Inc., 
San Diego, 1991). 

Once isolated, DNA encoding the protein to be sorted is then joined to 

15 the sorting signal. This is typically accomplished through ligation, such as using 

Escherichia coli or bacteriophage T4 ligase. Conditions for the use of these enzymes 
are well known and are described, for example, in the above general references. 

The ligation is done in such a way so that the protein to be sorted and 
the sorting signal are joined in a single contiguous reading frame so that a single 

20 protein is produced. This may, in some cases, involve addition or deletion of bases of 
the cloned DNA segment to maintain a single reading frame. This can be done by 
using standard techniques. 

Cloning is typically performed by inserting the cloned DNA into a 
vector containing control elements to allow expression of the cloned DNA. The 

25 vector is then incorporated into the bacterium in which expression is to occur, using 
standard techniques of transformation or other techniques for introducing nucleic 
acids into bacteria. 

One suitable cloning system for S. aureus places the cloned gene under 
the control of the BlaZRI regulon (P.Z. Wang et al., Nucl. Acids Res. 19:4000 

30 (1991)). Vectors and other cloning techniques for use in Staphylococcus aureus are 
described in B. Nilsson & L. Abrahmsen, "Fusion to Staphylococcal Protein A," in 
Gene Expression Technology , supra , p. 144-1 61. 

If the chimeric protein is cloned under control of the BlaZRI regulon, 
expression can be induced by the addition of the (3-lactam antibiotic methicillin. 

35 Another aspect of the present invention is a polypeptide displayed on 

the surface of a Gram-positive bacterium by covalent linkage of an amino— acid 
sequence of LPX3X4 derived from cleavage of an LPX3X4G motif, as described above. 
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Yet another aspect of the present invention is a covalent complex 
comprising: (1) the displayed polypeptide; and (2) an antigen or hapten covalently 
cross-linked to the polypeptide. 



5 B. Screening Methods 

These polypeptides associated with the cell surfaces of Gram-positive 
bacteria can be used in various ways for screening. For example, samples of 
expressed proteins from an expression library containing expressed proteins on the 
surfaces of the cells can be used to screen for clones that express a particular desired 
10 protein when a labeled antibody or other labeled specific binding partner for that 
protein is available. 

These methods are based on the methods for protein targeting and 
display described above. 

A first embodiment of such a method comprises: (1) expressing a 

15 cloned polypeptide as a chimeric protein having a sorting signal at its carboxy- 
terminal end as described above; (2) forming a reaction mixture including: (i) the 
expressed chimeric protein; (ii) a substantially purified sortase-transamidase enzyme; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; (3) binding of the 

20 chimeric protein covalently to the cell wall by the enzymatic action of a sortase- 
transamidase expressed by the Gram-positive bacterium involving cleavage of the 
chimeric protein within the LPX3X4G motif so that the polypeptide is displayed on the 
surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand; and (4) reacting the displayed polypeptide with a labeled 

25 specific binding partner to screen the chimeric protein for reactivity with the labeled 
specific binding partner. 

The nucleic acid segment encoding the chimeric protein is formed by 
methods well known in the art and can include a spacer. 

In the last step, the cells are merely exposed to the labeled antibody or 

30 other labeled specific binding partner, unreacted antibodies removed as by a wash, and 
label associated with the cells detected by conventional techniques such as 
fluorescence, chemiluminescence, or autoradiography. 

A second embodiment of this method employs expression in a Gram- 
positive bacterium that also produces a sortase-transamidase enzyme. This method 

35 comprises: (1) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 
carboxyl-terminal sorting signal as described above, the chimeric protein including 
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the polypeptide whose expression is to be screened; (2) growing the bacterium into 
which the nucleic acid segment has been cloned to express the cloned chimeric protein 
to generate a chimeric protein including therein a carboxyl-terminal sorting signal; (3) 
binding the polypeptide covalently to the cell wall by the enzymatic action of a 
5 sortase-transamidase expressed by the Gram-positive bacterium involving cleavage 
of the chimeric protein within the LPX3X4G motif so that the polypeptide is displayed 
on the surface of the Gram-positive bacterium in such a way that the polypeptide is 
accessible to a ligand; and (4) reacting the displayed polypeptide with a labeled 
specific binding partner to screen the chimeric protein for reactivity with the labeled 
1 0 specific binding partner. 

V. USE OF SORTED MOLECULES FOR DIAGNOSIS AND TREATMENT OF 
BACTERIAL INFECTIONS 

Sorted molecules can also be used for the diagnosis and treatment of 

15 bacterial infections caused by Gram-positive bacteria. Antibiotic molecules or 
fluorescent or any other diagnostic molecules can be chemically linked to a sorted 
peptide segment, which may include a spacer as described above, and then can be 
injected into animals or humans. These molecules are then sorted by the sortase- 
transamidase so that they are covalently linked to the cell wall of the bacteria. 

20 In general, these methods comprise: (1) conjugating an antibiotic or a 

detection reagent to a protein including therein a carboxyl-terminal sorting signal to 
produce a conjugate; and (2) introducing the conjugate to an organism infected with a 
Gram-positive bacterium in order to cause the conjugate to be sorted and covalently 
cross-linked to the cell walls of the bacterium in order to treat or diagnose the 

25 infection. 

The antibiotic used can be, but is not limited to, a penicillin, ampicillin, 
vancomycin, gentamicin, streptomycin, a cephalosporin, amikacin, kanamycin, 
neomycin, paromomycin, tobramycin, ciprofloxacin, clindamycin, rifampin, 
chloramphenicol, or norfloxacin, or a derivative of these antibiotics. 

30 The detection reagent is typically an antibody or other specific binding 

partner labeled with a detectable label, such as a radiolabel. Such methods are well 
known in the art and need not be described further here. 

Accordingly, another aspect of the present invention is a conjugate 
comprising an antibiotic or a detection reagent covalently conjugated to a protein 

35 including therein a carboxyl-terminal sorting signal as described above to produce a 
conjugate. 
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Yet another aspect of the present invention is a composition 
comprising the conjugate and a pharmaceutically acceptable carrier. 

In this context, the conjugates can be administered using conventional 
modes of administration, including, but not limited to, intravenous, intraperitoneal, 
5 oral, or intralymphatic. Other routes of administration can alternatively be used. Oral 
or intraperitoneal administration is generally preferred. The composition can be 
administered in a variety of dosage forms, which include, but are not limited to, liquid 
solutions or suspensions, tablets, pills, powders, suppositories, polymeric 
microcapsules or microvesicles, liposomes, and injectable or infusible solutions. The 

10 preferred form depends on the mode of administration and the quantity administered. 

The compositions for administration preferably also include 
conventional pharmaceutically acceptable carriers and adjuvants known in the art such 
as human serum albumin, ion exchangers, alumina, lecithin, buffered substances such 
as phosphate, glycine, sorbic acid, potassium sorbate, and salts or electrolytes such as 

15 protamine sulfate. The most effective mode of administration and dosage regimen for 
the conjugates as used in the methods in the present invention depend on the severity 
and course of the disease, the patient's health, the response to treatment, the particular 
strain of bacteria infecting the patient, other drugs being administered and the 
development of resistance to them, the accessibility of the site of infection to blood 

20 flow, pharmacokinetic considerations such as the condition of the patient's liver 
and/or kidneys that can affect the metabolism and/or excretion of the administered 
conjugates, and the judgment of the treating physician. According, the dosages should 
be titrated to the individual patient. 

25 VI. USE OF SORTED POLYPEPTIDES FOR PRODUCTION OF VACCINES 

Additionally, the sorted polypeptides covalently crosslinked to the cell 
walls of Gram-positive bacteria according to the present invention have a number of 
uses. One use is use in the production of vaccines that can be used to generate 
immunity against infectious diseases affecting mammals, including both human and 

30 non-human mammals, such as cattle, sheep, and goats, as well as other animals such 
as poultry and fish. This invention is of special importance to mammals. The 
usefulness of these complexes for vaccine production lies in the fact that the proteins 
are on the surface of the cell wall and are accessible to the medium surrounding the 
bacterial cells, so that the antigenic part of the chimeric protein is accessible to the 

35 antigen processing system. It is well known that presenting antigens in particulate 
form greatly enhances the immune response. In effect, bacteria containing antigenic 
peptides on the surfaces linked to the bacteria by these covalent interactions function 
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as natural adjuvants. Here follows a representative list of typical microorganisms that 
express polypeptide antigens against which useful antibodies can be prepared by the 
methods of the present invention: 

(1) Fungi: Candida albicans, Aspergillus fumigatus, Histoplasma 
5 capsulatum (all cause disseminating disease), Microsporum canis (animal ringworm). 

(2) Parasitic protozoa: (1) Plasmodium falciparum (malaria), 
Trypanosoma cruzei (sleeping sickness). 

(3) Spirochetes: (1) Borrelia bergdorferi (Lyme disease), Treponema 
pallidum (syphilis), Borrelia recurrentis (relapsing fever), Leptospira 

10 icterohaemorrhagiae (leptospirosis). 

(4) Bacteria: Neisseria gonorrhoeae (gonorrhea), Staphylococcus 
aureus (endocarditis), Streptococcus pyogenes (rheumatic fever), Salmonella typhosa 
(salmonellosis), Hemophilus influenzae (influenza), Bordetella pertussis (whooping 
cough), Actinomyces israelii (actinomycosis), Streptococcus mutans (dental caries), 

15 Streptococcus equi (strangles in horses), Streptococcus agalactiae (bovine mastitis), 
Streptococcus anginosus (canine genital infections). 

(5) Viruses: Human immunodeficiency virus (HIV), polio virus, 
influenza virus, rabies virus, herpes virus, foot and mouth disease virus, psittacosis 
virus, paramyxovirus, myxovirus, coronavirus. 

20 Typically, the resulting immunological response occurs by both 

humoral and cell-mediated pathways. One possible immunological response is the 
production of antibodies, thereby providing protection against infection by the 
pathogen. 

This method is not limited to protein antigens. As discussed below. 

25 non-protein antigens or haptens can be covalently linked to the C-terminal cell-wall 
targeting segment, which can be produced as an independently expressed polypeptide, 
either alone, or with a spacer at its amino-terminal end. If a spacer at the amino- 
terminal end is used, typically the spacer will have a conformation allowing the 
efficient interaction of the non-protein antigen or hapten with the immune system, 

30 most typically a random coil or a-helical form. The spacer can be of any suitable 

length; typically, it is in the range of about 5 to about 30 amino acids; most typically, 
about 10 to about 20 amino acids. In this version of the embodiment, the 
independently expressed polypeptide, once expressed, can then be covalently linked to 
the hapten or non-protein antigen. Typical non-protein antigens or haptens include 

35 drugs, including both drugs of abuse and therapeutic drugs, alkaloids, steroids, 

carbohydrates, aromatic compounds, including many pollutants, and other compounds 
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that can be covalently linked to protein and against which an immune response can be 
raised. 

Alternatively, a protein antigen can be covalently linked to the 
independently expressed cell-wall targeting segment or a cell-wall targeting segment 
5 including a spacer. 

Many methods for covalent linkage of both protein and non-protein 
compounds to proteins are well known in the art and are described, for example, in P. 
Tijssen, "Practice and Theory of Enzyme Immunoassays" (Elsevier, Amsterdam, 
1985), pp. 221-295, and in S.S. Wong, "Chemistry of Protein Conjugation and 

10 Cross-Linking" (CRC Press, Inc., Boca Raton, FL, 1993). 

Many reactive groups on both protein and non-protein compounds are 
available for conjugation. 

For example, organic moieties containing carboxyl groups or that can 
be carboxylated can be conjugated to proteins via the mixed anhydride method, the 

15 carbodiimide method, using dicyclohexylcarbodiimide, and the N— 
hydroxy succinimide ester method. 

If the organic moiety contains amino groups or reducible nitro groups 
or can be substituted with such groups, conjugation can be achieved by one of several 
techniques. Aromatic amines can be converted to diazonium salts by the slow 

20 addition of nitrous acid and then reacted with proteins at a pH of about 9. If the 

organic moiety contains aliphatic amines, such groups can be conjugated to proteins 
by various methods, including carbodiimide, tolylene-2,4-diisocyanate, or malemide 
compounds, particularly the N-hydroxysuccinimide esters of malemide derivatives. 
An example of such a compound is 4-(N-maleimidomethyl)-cyclohexane-l- 

25 carboxylic acid. Another example is m-maleimidobenzoyl-N-hydroxysuccinimide 
ester. Still another reagent that can be used is N-succinimidyl— 3-(2-pyridyldithio) 
propionate. Also, bifunctional esters, such as dimethylpimelimidate, 
dimethyladipimidate, or dimethylsuberimidate, can be used to couple amino-group- 
containing moieties to proteins. 

30 Additionally, aliphatic amines can also be converted to aromatic 

amines by reaction with p-nitrobenzoylchloride and subsequent reduction to a j>— 
aminobenzoylamide, which can then be coupled to proteins after diazotization. 

Organic moieties containing hydroxyl groups can be cross-linked by a 
number of indirect procedures. For example, the conversion of an alcohol moiety to 

35 the half ester of succinic acid (hemisuccinate) introduces a carboxyl group available 
for conjugation. The bifunctional reagent sebacoyldichloride converts alcohol to acid 
chloride which, at pH 8.5, reacts readily with proteins. Hydroxyl-containing organic 
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moieties can also be conjugated through the highly reactive chlorocarbonates, 
prepared with an equal molar amount of phosgene. 

For organic moieties containing ketones or aldehydes, such carbonyl- 
containing groups can be derivatized into carboxyl groups through the formation of 
5 O-(carboxymethyl) oximes. Ketone groups can also be derivatized with p- 
hydrazinobenzoic acid to produce carboxyl groups that can be conjugated to the 
specific binding partner as described above. Organic moieties containing aldehyde 
groups can be directly conjugated through the formation of Schiff bases which are 
then stabilized by a reduction with sodium borohydride. 

10 One particularly useful cross-linking agent for hydroxy 1-containing 

organic moieties is a photosensitive noncleavable heterobifunctional cross-linking 
reagent, sulfosuccinimidyl 6-[4'-azido-2'-nitrophenylamino] hexanoate. Other 
similar reagents are described in S.S. Wong, "Chemistry of Protein Conjugation and 
Cross-Linking," supra . 

15 Other cross-linking reagents can be used that introduce spacers 

between the organic moiety and the specific binding partner. 

These methods need not be described further here. 

VII. PRODUCTION OF SUBSTANTIALLY PURIFIED SORTASE- 
20 TRANSAMIDASE ENZYME 

Another aspect of the present invention is methods for the production 
of substantially purified sortase— transamidase enzyme. 

A. Methods Involving Expression of Cloned Gene 
25 One method for the production of substantially purified sortase- 

transamidase enzyme involves the expression of the cloned gene. The isolation of the 
nucleic acid segment or segments encoding the sortase-transamidase enzyme is 
described above; these nucleic acid segment or segments are then incorporated into a 
vector and then use to transform a host in which the enzyme can be expressed. In one 
30 alternative, the host is a Gram-positive bacterium. 

The next step in this alternative is expression in a Gram-positive 
bacterium to generate the cloned sortase-transamidase enzyme. Expression is 
typically under the control of various control elements associated with the vector 
incorporating the DNA encoding the sortase-transamidase gene; such elements can 
35 include promoters and operators, which can be regulated by proteins such as 

repressors. The conditions required for expression of cloned proteins in gram- 
positive bacteria, particularly S. aureus, are well known in the art and need not be 
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further recited here. An example is the induction of expression of lysostaphin under 
control of the BlaZRI regulon induced by the addition of methicillin. 

When expressed in Staphylococcus aureus, the chimeric protein is 
typically first exported with an amino-terminal leader peptide, such as the 
5 hydrophobic signal peptide at the amino-terminal region of the cloned lysostaphin of 
Recsei et al. (P. Recsei et al., "Cloning, Sequence, and Expression of the Lysostaphin 
Gene from Staphylococcus simulans" Proc. Natl. Acad. Sci. USA 84: 1 1 27-1 131 
(1987)). 

Alternatively, the cloned nucleic acid segment encoding the sortase- 
10 transamidase enzyme can be inserted in a vector that contains sequences allowing 
expression of the sortase-transamidase in another organism, such as E. coli or S. 
typhimurium. A suitable host organism can then be transformed or transfected with 
the vector containing the cloned nucleic acid segment. Expression is then performed 
in that host organism. 

15 The expressed enzyme is then purified using standard techniques. 

Techniques for the purification of cloned proteins are well known in the art and need 
not be detailed further here. One particularly suitable method of purification is 
affinity chromatography employing an immobilized antibody to sortase. Other protein 
purification methods include chromatography on ion-exchange resins, gel 

20 electrophoresis, isoelectric focusing, and gel filtration, among others. 

One particularly useful form of affinity chromatography for 
purification of cloned proteins, such as sortase-transamidase, as well as other 
proteins, such as glutathione S-transferase and thioredoxin, that have been extended 
with carboxyl-terminal histidine residues, is chromatography on a nickel-sepharose 

25 column. This allows the purification of a sortase-transamidase enzyme extended at 
its carboxyl terminus with a sufficient number of histidine residues to allow specific 
binding of the protein molecule to the nickel-sepharose column through the histidine 
residues. The bound protein is then eluted with imidazole. Typically, six or more 
histidine residues are added; preferably, six histidine residues are added. One way of 

30 adding the histidine residues to a cloned protein, such the sortase-transamidase, is 
through PCR with a primer that includes nucleotides encoding the histidine residues. 
The histidine codons are CAU and CAC expressed as RNA, which are CAT and CAC 
as DNA. Amplification of the cloned DNA with appropriate primers will add the 
histidine residues to yield a new nucleic acid segment, which can be recloned into an 

35 appropriate host for expression of the enzyme extended with the histidine residues. 
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B. Other Methods 

Alternatively, the sortase-traasamidase can be purified from Gram- 
positive bacteria by standard methods, including precipitation with reagents such as 
ammonium sulfate or protamine sulfate, ion-exchange chromatography, gel filtration 
5 chromatography, affinity chromatography, isoelectric focusing, and gel 
electrophoresis, as well as other methods known in the art. 

Because the sortase-transamidase is a cysteine protease, one 
particularly useful method of purification involves covalent chromatography by thiol- 
disulfide interchange, using a two-protonic— state gel containing a 2— 
10 mercaptopyridine leaving group, such as Sepharose 2B-glutathione 2-pyridyl 
disulfide or Sepharose 6B-hydroxypropyl 2^>yridyl disulfide. Such covalent 
chromatographic techniques are described in K. Brocklehurst et al., "Cysteine 
Proteases," in New Comprehensive Biochemistry. Volume 16: Hydro lytic Enzymes 
(A. Neuberger & K. Brocklehurst, eds., Elsevier, New York, 1987), ch. 2, pp. 39-158. 

15 

VIII. FURTHER APPLICATIONS OF SORTASE-TRANSAMIDASE 

A. Production of Antibodies 

Antibodies can be prepared to the substantially purified sortase- 
transamidase of the present invention, whether the sortase-transamidase is purified 

20 from bacteria or produced from recombinant bacteria as a result of gene cloning 
procedures. Because the substantially purified enzyme according to the present 
invention is a protein, it is an effective antigen, and antibodies can be made by well- 
understood methods such as those disclosed in E. Harlow & D. Lane, "Antibodies: A 
Laboratory Manual" (Cold Spring Harbor Laboratory, 1988). In general, antibody 

25 preparation involves immunizing an antibody-producing animal with the protein, with 
or without an adjuvant such as Freund's complete or incomplete adjuvant, and 
purification of the antibody produced. The resulting polyclonal antibody can be 
purified by techniques such as affinity chromatography. 

Once the polyclonal antibodies are prepared, monoclonal antibodies 

30 can be prepared by standard procedures, such as those described in Chapter 6 of 
Harlow & Lane, supra . 

B. Derivatives for Affinity Chromatography 

Another aspect of the present invention is derivatives of the cloned, 
35 substantially purified sortase-transamidase of the present invention extended at its 
carboxyl terminus with a sufficient number of histidine residues to allow specific 
binding of the protein molecule to a nickel-sepharose column through the histidine 
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residues. Typically, six or more histidine residues are added; preferably, six histidine 
residues are added. 

The histidine residues can be added to the carboxyl terminus through 
PCR cloning as described above. 
5 This invention is further described by means of the following example. 

This Example is for illustrative purposes only, and are not to be construed as limiting 
the scope of the invention in any manner. 

EXAMPLE 

10 Identification of Sortase-Transamidase 

Generation of ts Mutants through Chemical Mutagenesis 

To create random mutations in the chromosome, Staphylococcus 
aureus strain OS2 (RN4220 erm spa-) was mutagenized by exposure to the DNA- 
modifying agent N-methyl-N-nitro-N-nitrosoguanidine. Cultures were incubated 

15 with the mutagen for varying periods of time, then placed on TSB agar plates to 

measure viability. Cultures were subsequently plated on TSB+ rifampicin (10 jig/ml) 
to determine the mutation frequency based on resistance to the single target site 
antibiotic. Once a maximum mutation frequency was reached, cell cultures were 
exposed to two successive rounds of a penicillin selection (5 jag/ml at 42°C) to enrich 

20 for mutants that had a growth effect by lysis of the cells growing at this temperature. 
Mutants were screened for growth at 30°C and 42°C by streaking individual colonies 
on TSB agar plates at the permissive and non-permissive temperatures, respectively. 
These colonies that demonstrated a growth defect at the non-permissive temperature 
were rechecked at 42°C, and subsequently frozen at -80°C in a 5% BSA, 5% 

25 monosodium glutamate (MSG) solution. In this manner, a collection of temperature- 
sensitive mutants was assembled. 



Transformation and Screening of ts Mutants 

In order to isolate mutants that demonstrated a defect in the surface 

30 display of protein, it was necessary to develop a screening process to locate these 

strains. Previous studies that had indicated a typical secretory-dependent process, in 
conjunction with known C-terminal cleavage of translocated proteins, were used to 
elucidate a selection scheme to isolate the desirable mutants. The construct harboring 
the staphylococcal enterotoxin B (SEB) gene fused to the cell wall sorting signal of 

35 staphylococcal protein A (SPA) was used in this assay. This reporter molecule has 

been shown to be properly processed not only by the secretory machinery and through 
signal peptidase cleavage of an N-terminal secretion signal, but also to be correctly 
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sorted after secretion to its peptidoglycan substrate. Mutant cells were made 
competent using 0.5 M sucrose washes, and were transformed with the 
staphylococcal/Escherichia coli shuttle vector pOS 1 containing the SEB-SPA 
reporter through electroporation. Transformants were selected by virtue of their 
5 resistance to chloramphenicol as encoded by the plasmid. The cells were then 

screened for properties indicative of a defect in the processing of precursor molecules 
to a fully matured and anchored surface protein. 

Cultured cells were induced at 42°C, and then pulsed with S-35 
ProMix (80% Met, 20% Cys), in order to label all synthesized proteins. Samples were 

10 precipitated through acid treatment, then digested with lysostaphin (100 fig/ml) and 
subsequently reprecipitated. This was followed by solubilization in hot sodium 
dodecyl sulfate (4%), and an immunoprecipitation with anti-SEB antibodies. 
Samples were run on sodium dodecyl sulfate-polyacrylamide gel electrophoresis and 
finally exposed to phosphorimager evaluation. The resulting banding patterns were 

1 5 analyzed and quantitated. 

This procedure yielded three anti-SEB reacting species, termed PI, P2 
and M. PI, the largest precursor which migrated to 33 kDa, represents the complete 
gene product encoded by the SEB-SPA construct, with no modifications. P2, the 
second precursor, most likely represents the product after cleavage of the amino- 

20 terminal signal peptide found in SEB, and thus migrates to 32 kDa. The smallest 
species, M, is a lysostaphin-solubilized, maturely anchored peptide that has neither 
the signal peptide nor the remainder of the carboxyl-terminal sorting signal after the 
cleavage. This band migrates at approximately 29 kDa. 

Analysis of these species was conducted through phosphorimager 

25 quantitation, and mutants were selected based upon the proposed phenotype that the 
inhibition in sorting would result in an accumulation of P2 and a reduction in the 
production of M. Through this process, two mutants, SM3 1 7 and SM329, were 
earmarked for further analysis due to their elevated ratio of P2/M over wild-type. 

Both mutants finally demonstrated an accumulation of P2 after a 5 min 

30 kinetic analysis, but also clearly showed a decrease in the mutant's ability to degrade 
the species over time as measured by samples chased with cold methionine. These 
results were interpreted to mean that the ability of the mutants to process mature cell 
wall anchored peptides was impaired, quite possibly due to the less efficient activity 
of the sortase-transamidase enzyme. 

35 Protoplasts were made of these mutants in order to cure them of their 

plasmids encoding the reporter construct, and subsequently retransformed with the 
SEB/SPA containing plasmid to once again test for preservation of this phenotype. 
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Upon a favorable result, the mutants were prepared for complementation by a S. 
aureus chromosomal library. 

Generation of an S. aureus Chromosomal Library and Complementation ofts Defect 
5 A staphylococcal library was made through a Sau3 AI digest of the 

chromosomal DNA preparation from S. aureus strain RN4220. DNA was isolated 
through a phenol-chloroform extraction from lysed cells, and digested for various 
times until the correct partial digest pattern was observed. Fragments greater than 2.5 
kb were inserted into the BamHI cloning site in the multi-cloning sequence (MCS) of 
10 plasmid pC194-MCS. This heterogeneous mixture of plasmids was then transformed 
into competent OS2 cells. Approximately 15,000 clones were harvested. DNA was 
prepared and transformed into competent cells made of both mutants, and 
simultaneously plated at 30°C and 42°C to screen for complementation of the ts 
mutant phenotype. 

15 Through this process, four chromosomal inserts from each mutant were 

found to complement the ts phenotype by conferring growth at 42°C. Due to the 
nature of the mutagenesis, it is at this point necessary to demonstrate definitively that 
the ts defect is somehow linked to the defect in processing. This is done through 
illustration that the plasmids harboring the chromosomal inserts not only complement 

20 the temperature sensitivity, but also relieve the accumulation of P2 at the expense of 
M in these mutants. Therefore, the complemented mutants were screened along with 
the non-complemented versions against wild type OS2. 

Screen for Sorting Defect Complementation 

25 This assay was conducted differently for each of the two mutants. 

SM-3 1 7 was screened by the insertion of the SEB/SPA fragment aboard the 
replication defective pCL 84 vector that possessed integration capability into the 
chromosome of S. aureus cells. The site-specific integration, mediated by the 
integrase gene supplied in trans by pCLl 12, disrupts the lipase gene, which can be 

30 assayed for by the lack of hydrolysis on egg yolk agar plates. Once successfully 

integrated, the RN4220 chromosomal fragments that complement the ts mutation can 
be added to make the cells ready for screening. 

SM329 was assayed by another approach. The pC194 plasmid 
harboring the complementing stretch of DNA was fused to an E. colt replicon 

35 pHSG399 that contained the SEB/SPA gene. The shuttle vector thus provided both a 
reporter substrate as well as a complementing activity. 
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The result of this screen demonstrated that over several time periods of 
the pulse-chase, the complementing insert added to the mutant reduced the accretion 
of P2 from the elevated level characteristic of the mutant to the wild type range in 
both mutants, the more dramatically in SM-329. This may be due to the fact that the 
5 substrate for sorting in SM-3 1 7 is found in only one copy per cell, whereas in SM- 
329, the reporter aboard pC194 is present at approximately 15 copies. Nevertheless, 
these results indicate that these mutants are in the sortase-transamidase gene and the 
sequencing of both chromosomal inserts was therefore undertaken. It should be 
pointed out that the complementing activity of each of the respective mutants was not 
10 transferable to the other, neither in terms of temperature sensitivity nor for the 

processing defect. Also, the four complementing clones isolated from each mutant 
seemed to behave in an identical manner, and also seemed to possess very similar 
restriction sites when digested with various specific endonucleases. Therefore, one 
clone was chosen from each mutant for sequencing. 

15 

Sequencing and Characterization of the S. aureus Complementing Determinants 

The chromosomal inserts carrying the sorting defect complementing 
capabilities were sequenced. This work was done completely by automated sequence 
analysis using dideoxyribonucleotides. Sequence data was confirmed by duplicate 

20 analysis of both strands of DNA. Comparison was done to all known nucleotide and 
protein sequence was currently found in the GenBank service. The partial, crude 
sequence of the S. aureus gene is shown in Figure 10 (SEQ ID NO: 28 & 29). The 
partial carboxy-terminal amino acid sequence of the open reading frame generated 
from the gene sequence of Figure 10 (SEQ ID NO: 28 & 29) is shown in Figure 1 1 

25 (SEQ ID NO: 2). 

Several stretches of high homology were found, to both known and 
putative proteins of varying function. The first 364 bases of the SM-3 17 
complementing gene insert been identified as encoding a protein that is a homologue 
of a putative Bacillus peptidase in the GCVT-SPOIIIAA intergenic region (GenBank 

30 Accession No. 1731048; Y. Kobayashi et al.). The sequence of this putative peptidase 
is shown in Figure 5 (SEQ ID NO: 3) and its hydrophobicity profile is shown in 
Figure 6. The hydrophobicity is calculated according to the method of J. Kyte & R.F. 
Doolittle, "A Simple Method for Displaying the Hydropathic Character of a Protein," 
J. Mol. Biol. 157: 105-132 (1982). To a lesser degree of homology, the protein 

35 encoded by this complementing gene insert is homologous to aminopeptidase P of 
Lactococcus lactis (GenBank Accession No. 1915907; J. Matos). The amino acid 
sequence of this aminopeptidase is shown in Figure 7 (SEQ ID NO: 4). To a still 
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lesser degree of homology, the protein encoded by this complementing gene insert is 
homologous to the proline dipeptidase of Lactobacillus delbrueckii lactis (GenBank 
Accession No. 1 172066; K. Stucky et al., "Cloning and DNA Sequence Analysis of 
pepQ, a Prolidase Gene from Lactobacillus delbrueckii subsp. lactis and Partial 
5 Characterization of Its Product," MoL Gen. Genet. 247: 494-500 (1995). The amino 
acid sequence of this proline dipeptidase is shown in Figure 8 (SEQ ID NO: 5). 

An additional complementing gene insert, in a vector designated 
pCOMPl, has also been sequenced. The DNA sequence of this complementing gene 
insert is shown in Figure 12 (SEQ ID NO: 30), together with the amino acid sequence 

10 (SEQ ID NO: 3 1) of the protein translated from this DNA sequence, and the 

hydrophobicity profile of the protein translated from the DNA sequence is shown in 
Figure 13. The amino acid sequence of the protein translated from the sequence of 
Figure 12 (SEQ ID NOS: 30 & 31) has virtually no homology with the amino acid 
sequence of the protein shown in Figure 1 1 (SEQ ID NO: 2). In particular, the amino 

15 acid sequence of the protein of Figure 12 (SEQ ID NOS: 30 & 31) has a single 
cysteine residue. 

Although Applicants do not intend to be bound by this theory, the 
existence of these two complementing inserts that define different polypeptide 
sequences suggests that the sortase-transamidase enzyme of S. aureus is a 

20 heterooligomer, with two or more different subunits that have different amino acid 
sequences. Mutations in at least two different subunits can give rise to the 
temperature sensitive phenotype and can then be complemented for. Alternatively, the 
synthesis of the peptidoglycan may require additional enzymes. 

Upon completion of the sequencing, and a study of all open reading 

25 frames, candidate genes were selected for further analysis. These genes were 

expressed in the mutants to determine if they complement both the ts and sorting 
defects. Upon success in this capacity, the genes were disrupted in wild-type S. 
aureus to determine their essentiality and possible biological roles. 

30 Materials and Methods 

Mutagenesis of S. aureus Strain OS2 . S. aureus strain OS2 (RN4220 

erm spa-) was mutagenized by treatment with N-methyl-N-nitro-N- 

nitrosoguanidine at 2 mg/ml. Culture OD was measured at 660 nm for viability. 

After 45 min, cultures were spun down at 4,000 rpm for 10 min, and washed with 
35 citrate buffer, pH 5.5. Cells were resuspended in citrate buffer to a concentration of 

5x1 0 8 cells/ml. These cultures were serially diluted in TSB (tryptic soy broth) and 
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plated on TSB+ rifampicin (10 jag/ml) agar plates. Mutation frequencies were 
determined to be 5x10 mutation/cfu. 

Enrichment and Selection of ts Mutants . A 1 : 1 00 dilution of cell 
cultures grown overnight at 30°C was added to TSB, and allowed to grow for 2 hr, at 
5 which time penicillin G was added at 5 jug/ml. Culture viability was measured by 
taking OD readings at 660 nm at various times until the concentration dropped to a 
stable point. Cells were washed twice in TSB, and the enrichment was repeated a 
second time. 

After two successive rounds of penicillin selection for growth arrest at 

10 42°C, individual colonies were picked and simultaneously streaked on duplicate TSB 
plates incubated at 30°C and 42°C. Those colonies that had a growth defect at 42°C 
were rechecked at this nonpermissive temperature, and stored at -80°C in a 5% BSA 
5% MSG solution. 

Transformation of Competent Cells . Mutant cells were made 

15 competent by diluting overnight cultures 1 : 10 in TSB, and growing to 0.3 OD at 660 
nm. Cells were then spun down at 7,500 rpm for 15 min and resuspended in an equal 
volume of 0.5 M sucrose. After another pelleting, cells were resuspended in 0.5 
volume sucrose, and incubated for 30 min at 4°C. Following another spin, cells were 
brought up in 0.1 volume sucrose. These competent cells were then transformed with 

20 the appropriate plasmids encoding chloramphenicol (10 jag/ml) resistance by 

electroporation at 200 ohms, 25 jiF, and 2.5 kV in 0.2 cm cuvettes. Cells were plated 
on TSB plus chloramphenicol incubated at 30°C. Transformed strains were frozen in 
BSA/MSG at -80°C. 

Pulse-Chase Screen of Mutants . Strains were inoculated in chemically 

25 defined media with chloramphenicol and grown overnight at 30°C. Cultures were 
diluted 1:10 into medium IV (O. Schneewind et al., "Sorting of Protein A to the 
Staphylococcal Cell Wall," CeU 70: 267-281 (1992)), grown for 3 hr, and then 
induced at 42°C for 20 min. At this time, cultures were pulse labeled with 50 juCi of 
S-35 ProMix for 5 min, and then terminated with 5% trichloroacetic acid (TCA). 

30 Cells were incubated at 4° for 30 min, centrifuged at 12,500 rpm for 15 min, and the 
supernatants aspirated. After resuspension in acetone, cells were spun again and 
aspirated to dryness. At this time, cells were treated with lysostaphin (100 ]ug/ml) for 
30 min or until noticeable clearing, and subjected to another TC A/acetone 
precipitation. After lysis of cells by boiling for 10 min with 4% SDS in 0.5m Tris, pH 

35 8.0, proteins were immunoprecipitated with anti-SEB for 1 hr and protein A— 

Sepharose beads for another 1 hr. Samples were washed three times in RIPA buffer, 
pH 8.0 (300 mM NaCl, 2% Triton X-100, 1% deoxycholate, 0.2% SDS), and protein 
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was solubilized with SDS-urea sample buffer (50 mM Tris-HCl, pH 6.8, 3% SDS, 
5% 2-mercaptoethanol, 3.5 M urea), with boiling for 10 min. Samples were run on 
SDS-polyacrylamide gel electrophoresis gels and exposed to a phosphorimager screen 
overnight. Quantitations were done on ImageQuant software. 
5 DNA Sequencing . DNA was sequenced on a Perkin-Elmer automated 

sequencer after PCR using dye-terminating ready reaction mixed with SS-Taq 
polymerase. GenBank analysis was done using BLAST software to search the 
database. 

10 ADVANTAGES OF THE PRESENT INVENTION 

In isolating and characterizing the gene for the S. aureus sortase- 
transamidase enzyme, we have determined the existence of a new site for antibiotic 
action that can be used to screen new antibiotics active against Gram-positive 
pathogens, such as Staphylococcus, Actinomyces, Mycobacterium, Streptococcus, 

15 Bacillus, and other medically important Gram-positive pathogens increasingly 
resistant to conventional antibiotics. The availability of substantially purified S. 
aureus sortase-transamidase enzyme provides a method of screening compounds for 
inhibition of the enzyme. 

The purified sortase-transamidase enzyme of the present invention 

20 also yields a method of surface display of peptides and proteins that has advantages 
over phage display, as well as providing methods for producing vaccines against a 
large variety of antigens that can be covalently bound to the surfaces of Gram- 
positive bacteria. 

Although the present invention has been described with considerable 
25 detail, with reference to certain preferred versions thereof, other versions and 

embodiments are possible. Therefore, the scope of the invention is determined by the 
following claims. 
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We claim: 

1 . A substantially purified sortase-transamidase enzyme from a 

5 Gram-positive bacterium, the enzyme catalyzing a reaction that covalently cross- 
links the carboxyl terminus of a protein having a sorting signal to the peptidoglycan of 
a Gram-positive bacterium, the sorting signal having a motif of LPX3X4G therein, 
wherein sorting occurs by cleavage between the fourth and fifth residues of the 
LPX3X4G motif. 

10 

2. The substantially purified sortase-transamidase enzyme of claim 1 
wherein the Gram-positive bacterium is a species selected from the group consisting 
of Staphylococcus aureus , S. sobrinus, Enterococcus faecalis, Streptococcus 
pyogenes, and Listeria monocytogenes. 

15 

3 . The substantially purified sortase-transamidase enzyme of claim 2 
wherein the Gram-positive bacterium is Staphylococcus aureus. 

4. The substantially purified sortase-transamidase enzyme of claim 1 
20 wherein one subunit of the enzyme has a molecular weight of about 41 ,000 daltons. 

5. The substantially purified sortase-transamidase enzyme of claim 4 
wherein the sorting signal further comprises:(2) a substantially hydrophobic domain of 
at least 3 1 amino acids carboxyl to the motif; and (3) a charged tail region with at least 

25 two positively charged residues carboxyl to the substantially hydrophobic domain, at 
least one of the two positively charged residues being arginine, the two positively 
charged residues being located at residues 31-33 from the motif, wherein X 3 is any of 
the twenty naturally-occurring L-amino acids and X4 is selected from the group 
consisting of alanine, serine, and threonine. 

30 

6. The enzyme of claim 1 wherein the enzyme includes therein an 
amino acid sequence selected from the group consisting of :(1) D-P-K-L-K-E-I- 
Y-Q-I-V-L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E-A-D-A-I-S- 
R-N-Y-L-E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L-E-I-H-E-G- 

35 P-M-L-A-R-T-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V-Y-I-E-G-L- 
G-I-R-I-E-D-D-I-L-I-T-E-N-G-C-Q-V-F-T-K-C-T-K-D-L-I-V-L-T 
(SEQ ID NO: 2); (2) M-V-K-V-T-D-Y-S-N-S-K-L-G-K-E-I-A-P-E-V-L- 
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S-V-I-A-S-I-A-T-S-E-V-E-G-I-T-G-H-F-A-E-L-K-E-T-N-L-E-K-V- 
S-R-K-N-L-S-R-D-L-K-I-E-S-K-E-G-I-Y-I-D-V-Y-C-A-L-K-H-G- 
V-N-I-S-K-T-A-N-K-I-Q-T-S-I-F-N-S-I-S-N-M-T-A-I-E-P-K-Q-I- 
N-I-H-I-T-Q-I-V-I-E-K (SEQ ID NO: 31); and (3) sequences incorporating one 
5 or more conservative amino acid substitutions in SEQ ID NO:2 or SEQ ID NO: 31, 
wherein the conservative amino acid substitutions are any of the following: (1) any of 
isoleucine, leucine, and valine for any other of these amino acids; (2) aspartic acid for 
glutamic acid and vice versa; (3) glutamine for asparagine and vice versa; and (4) 
serine for threonine and vice versa. 

10 

7. The enzyme of claim 6 wherein the amino acid sequence is D— P- 
K-L-K-E-I-Y-Q-I-V-L-E-S-Q-M-K-A-I-N-E-I-R-P-G-M-T-G-A-E- 
A-D-A-I-S-R-N-Y-L-E-S-K-G-Y-G-K-E-F-G-H-S-L-G-H-G-I-G-L- 
E-I-H-E-G-P-M-L-A-R-T-I-Q-D-K-L-Q-V-N-N-C-V-T-V-E-P-G-V- 

1 5 Y-I-E-G-L-G-I-R-I-E-D-D-I-L-I-T-E-N-G-C-Q- V-F-T-K-C-T-K-D- 
L-I-V-L-T (SEQ ID NO: 2). 

8. The enzyme of claim 6 wherein the amino acid sequence is M— V- 
K-V-T-D-Y-S-N-S-K-L-G-K-E-I-A-P-E-V-L-S-V-I-A-S-I-A-T-S-E- 

20 V-E-G-I-T-G-H-F-A-E-L-K-E-T-N-L-E-K-V-S-R-K-N-L-S-R-D-L- 

■ 

K-I-E-S-K-E-G-I-Y-I-D-V-Y-C-A-L-K-H-G-V-N-I-S-K-T-A-N-K-I- 

Q-T-S-I-F-N-S~I-S~N~M-T-A~I-E-P-K-Q-I-N-I-H-I-T-Q-I-V-I-E-K 
(SEQ ID NO: 31). 

25 9. A nucleic acid sequence encoding the enzyme of claim 6. 

10. A nucleic acid sequence encoding the enzyme of claim 7. 

1 1 . A nucleic acid sequence encoding the enzyme of claim 8. 

30 

12. A nucleic acid sequence encoding a substantially purified sortase- 
transamidase enzyme from a Gram-positive bacterium, the enzyme having a subunit 
with a molecular weight of about 4 1 ? 000 daltons and catalyzing a reaction that 
covalently cross-links the carboxyl terminus of a protein having a sorting signal to the 

35 peptidoglycan of a Gram-positive bacterium, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
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residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
5 serine, and threonine, and wherein sorting occurs by cleavage between the fourth and 
fifth residues of the LPX3X4G motif, wherein the nucleic acid sequence includes 
therein a sequence selected from the group consisting of: (1) 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 

AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 

10 TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 

TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 

TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 

CTGGTGTTTATATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATT 

TTAATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTT 
15 ATAGTTTTAACATAA (SEQ ID NO: 28); (2) 

ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAAAAA 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 
20 TGGCATATATATAGATGTATATTGTGCATTAAAACATGGTAATATTTCAAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAATTTCTAATATGAC 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30); or (3) a sequence complementary to SEQ ID NO: 28 
or SEQ ID NO: 30, 

25 

13. A nucleic acid sequence encoding a substantially purified sortase- 
transamidase enzyme from a Gram-positive bacterium, the enzyme having a subunit 
with a molecular weight of about 41,000 daltons and catalyzing a reaction that 
covalently cross-links the carboxyl terminus of a protein having a sorting signal to the 

30 peptidoglycan of a Gram-positive bacterium, the sorting signal having (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 

35 located at residues 31-33 from the motif, wherein X3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine, and wherein sorting occurs by cleavage between the fourth and 
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fifth residues of the LPX3X4G motif, wherein the nucleic acid sequence hybridizes 
with a sequence selected from the group consisting of: (1) 

GATCCTAAACTGAAAGAAATATATCAAATAGTACTTGAATCTCAAATGAA 
AGCAATTAATGAGATTAGACCTGGCATGACTGGTGCAGAAGCTGATGCCA 
5 TTTCAAGAAACTATTTAGAGTCAAAAGGGTATGGAAAAGAATTTGGACAT 
TCACTAGGACATGGTATTGGTTTAGAAATCCATGAAGGGCCAATGCTGGC 
TCGTACGATACAAGATAAACTTCAAGTTAACAACTGTGTTACAGTAGAAC 
CTGGTGTTTATATAGAAGGTTTGGGCGGTATAAGAATAGAAGATGATATT 

TTAATTACAGAAAATGGTTGTCAAGTCTTTACTAAATGCACAAAAGACCTT 
10 ATAGTTTTAACATAA (SEQ ID NO: 28); (2) 

ATGGTCAAAGTAACTGATTATTCAAATTCAAAATTAGGTAAAGTAGAAAT 
AGCGCCAGAAGTGCTATCTGTTATTGCAAGTATAGCTACTTCGGAAGTCG 
AAGGCATCACTGGCCATTTTGCTGAATTAAAAGAAACAAATTTAGAAAAA 
GTTAGTCGTAAAAATTTAAGCCGTGATTTAAAAATCGAGAGTAAAGAAGA 

1 5 TGGC ATATATATAGATGTATATTGTGC ATT AAA AC ATGGTAATATTTC AAA 
AACTGCAAACAAAATTCAAACGTCAATTTTTAATTCAATTTCTAATATGAC 
AGCGATAGAACCTAAGCAAATTAATATTCACATTACACAAATCGTTATTG 
AAAAGTAA (SEQ ID NO: 30) or (3) a sequence complementary to SEQ ID NO: 28 
or SEQ ID NO: 30, with no greater than about a 15% mismatch under stringent 

20 conditions. 

14. The nucleic acid sequence of claim 13 wherein the mismatch is no 
greater than about 5%. 

25 15. The nucleic acid sequence of claim 14 wherein the mismatch is no 

greater than about 2%. 

16. A vector comprising the nucleic acid sequence of claim 9 
operatively linked to at least one control sequence that controls the expression or 

30 regulation of the nucleic acid sequence. 

17. A vector comprising the nucleic acid sequence of claim 10 
operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 



35 
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18. A vector comprising the nucleic acid sequence of claim 1 1 
operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

5 19. A vector comprising the nucleic acid sequence of claim 12 

operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

20. A vector comprising the nucleic acid sequence of claim 13 

10 operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

21 . A host cell transfected with the vector of claim 16. 
15 22. A host cell transfected with the vector of claim 17. 

23. A host cell transfected with the vector of claim 1 8. 



20 



24. A host cell transfected with the vector of claim 19. 



25. A host cell transfected with the vector of claim 20. 



26. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 
25 (a) culturing the host cell of claim 21 under conditions in which the 

host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
sortase-transamidase enzyme. 

30 27. A method for producing a substantially purified sortase- 

transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 22 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
35 sortase-transamidase enzyme. 
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28. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 23 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

5 (b) purifying the expressed enzyme to produce substantially purified 

sortase-transamidase enzyme. 

29. A method for producing a substantially purified sortase- 
transamidase enzyme comprising the steps of: 

10 (a) culturing the host cell of claim 24 under conditions in which the 

host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
sortase-transamidase enzyme. 

15 30. A method for producing a substantially purified sortase- 

transamidase enzyme comprising the steps of: 

(a) culturing the host cell of claim 25 under conditions in which the 
host cell expresses the encoded sortase-transamidase enzyme; and 

(b) purifying the expressed enzyme to produce substantially purified 
20 sortase-transamidase enzyme. 

3 1 . Substantially purified sortase-transamidase enzyme produced by 
the process of claim 26. 

25 32. Substantially purified sortase-transamidase enzyme produced by 

the process of claim 27. 

33. Substantially purified sortase-transamidase enzyme produced by 
the process of claim 28. 

30 

34. Substantially purified sortase-transamidase enzyme produced by 
the process of claim 29. 

35. Substantially purified sortase-transamidase enzyme produced by 
35 the process of claim 30. 
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36. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 1; 

5 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

10 

37. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 3 ; 

1 5 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

20 

38. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 3 1 ; 

25 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

30 

39. A method for screening a compound for anti-sortase-transamidase 
activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 32; 

35 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 
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(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase^ 
transamidase activity. 

5 40. A method for screening a compound for anti-sortase-transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 33; 

(b) performing an assay for sortase-transamidase in the presence and in 
10 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

15 41 . A method for screening a compound for anti-sortase-transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 34; 

(b) performing an assay for sortase-transamidase in the presence and in 
20 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

25 42. A method for screening a compound for anti-sortase-transamidase 

activity comprising the steps of: 

(a) providing the substantially purified sortase-transamidase enzyme of 

claim 3 5 ; 

(b) performing an assay for sortase-transamidase in the presence and in 
30 the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 
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43 . A method for screening a compound for anti— sortase-transamidase 
activity comprising the steps of: 

(a) providing an active fraction of sortase-transamidase enzyme from 
a Gram— positive bacterium; 
5 (b) performing an assay for sortase-transamidase in the presence and in 

the absence of the compound; and 

(c) comparing the activity of the sortase-transamidase enzyme in the 
presence and in the absence of the compound to screen the compound for sortase- 
transamidase activity. 

10 

44. The method of claim 43 wherein the active fraction of sortase- 
transamidase enzyme is a particulate fraction from Staphylococcus aureus. 



45. The method of claim 43 wherein the assay for sortase- 
1 5 transamidase enzyme is performed by monitoring the capture of a soluble peptide that 
is a substrate for the enzyme by its interaction with an affinity resin. 



46. The method of claim 45 wherein the soluble peptide includes a 
sequence of at least six histidine residues and the affinity resin contains nickel. 

20 

47. The method of claim 45 wherein the soluble peptide includes the 
active site of glutathione S-transferase and the affinity resin contains glutathione. 

48. The method of claim 45 wherein the soluble peptide includes the 
25 active site of streptavidin and the affinity resin contains biotin. 

49. The method of claim 45 wherein the soluble peptide includes the 
active site of maltose binding protein and the affinity resin contains amylose. 

30 50. An antibody specifically binding the substantially purified sortase- 

transamidase enzyme of claim 1 . 



5 1 . An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 3 . 

35 

52. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 3 1 . 
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53. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 32. 

5 54. An antibody specifically binding the substantially purified sortase- 

transamidase enzyme of claim 33. 

55. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 34. 

10 

56. An antibody specifically binding the substantially purified sortase- 
transamidase enzyme of claim 35. 

57. A protein molecule comprising the substantially purified sortase- 
15 transamidase enzyme of claim 1 extended at its carboxyl-terminus with a sufficient 

number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column through the histidine residues added at the carboxyl- 
terminus. 

20 58. A protein molecule comprising the substantially purified sortase- 

transamidase enzyme of claim 3 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column through the histidine residues added at the carboxyl- 
terminus. 

25 

59. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 3 1 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

30 

60. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 32 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 



35 
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61. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 33 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

5 

62. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 34 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

10 

63. A protein molecule comprising the substantially purified sortase- 
transamidase enzyme of claim 35 extended at its carboxyl-terminus with a sufficient 
number of histidine residues to allow specific binding of the protein molecule to a 
nickel-sepharose column. 

15 

64. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

20 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 

25 and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase of claim 1; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and 

30 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4 motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 



35 
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65. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

5 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
10 and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase of claim 3; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide; and 

1 5 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

20 

66. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

25 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 

30 and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 31; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 

35 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
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peptidoglycan to display the polypeptide on the surface of the Gram— positive 
bacterium. 

67. A method for displaying a polypeptide on the surface of a Gram- 
5 positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 

10 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 

15 polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 32; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 

(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 

20 cross— links the amino— terminal portion of the cleaved polypeptide to the 

peptidoglycan to display the polypeptide on the surface of the Gram— positive 
bacterium. 

68. A method for displaying a polypeptide on the surface of a Gram- 
25 positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 

30 substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 

35 polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 33; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 



WO 99/09145 



PCT/US98/16229 



-54- 

(c) allowing the sortase— transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
5 bacterium. 



69. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 
10 terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 31-33 
15 from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 
and X 4 is selected from the group consisting of alanine, serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 34; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 

20 transamidase can link the polypeptide; and 

(c) allowing the sortase-transamidase to catalyze a reaction that 
cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 

25 bacterium. 



70. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) expressing a polypeptide having a sorting signal at its carboxy- 

30 terminal end, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
being arginine, the two positively charged residues being located at residues 3 1-33 

35 from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 
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(b) forming a reaction mixture including: (i) the expressed 
polypeptide; (ii) the substantially purified sortase-transamidase enzyme of claim 35; 
and (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide; and 
5 (c) allowing the sortase-transamidase to catalyze a reaction that 

cleaves the polypeptide within the LPX3X4G motif of the sorting signal and covalently 
cross-links the amino-terminal portion of the cleaved polypeptide to the 
peptidoglycan to display the polypeptide on the surface of the Gram-positive 
bacterium. 

10 

71. A method for displaying a polypeptide on the surface of a Gram- 
positive bacterium comprising the steps of: 

(a) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 

15 carboxyl-terminal sorting signal, the chimeric protein including the polypeptide to be 
displayed, the sorting signal having: (1) a motif of LPX3X4G therein; (2) a 
substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 

20 being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X3 is any of the twenty naturally-occurring L-amino acids 
and X4 is selected from the group consisting of alanine, serine, and threonine; 

(b) growing the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 

25 including therein a carboxyl-terminal sorting signal; and 

(c) binding the polypeptide covalently to the cell wall by the enzymatic 
action of a sortase-transamidase expressed by the Gram-positive bacterium involving 
cleavage of the chimeric protein within the LPX3X4G motif so that the polypeptide is 
displayed on the surface of the Gram-positive bacterium in such a way that the 

30 polypeptide is accessible to a ligand. 

72. A polypeptide displayed on the surface of a Gram-positive 
bacterium by covalent linkage of an amino-acid sequence of LPX3X4 derived from 
cleavage of an LPX3X4G motif, wherein X 3 is any of the twenty naturally-occurring 

35 L-amino acids and X4 is selected from the group consisting of alanine, serine, and 
threonine, the polypeptide being displayed on the surface of the Gram-positive 
bacterium in such a way that the polypeptide is accessible to a ligand. 
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73. A co valent complex comprising: 

(a) the polypeptide of claim 72; and 

(b) an antigen or hapten covalently cross-linked to the polypeptide. 

5 

74. The covalent complex of claim 73 wherein an antigen is covalently 
cross-linked to the polypeptide. 

75. The covalent complex of claim 73 wherein a hapten is covalently 
10 cross-linked to the peptide. 

76. A method for vaccination of an animal comprising the step of 
immunizing the animal with the displayed polypeptide of claim 72 to generate an 
immune response against the displayed polypeptide. 

15 

77. A method for vaccination of an animal comprising the step of 
immunizing the animal with the covalent complex of claim 73 to generate an immune 
response against the antigen or hapten of the covalent complex. 

20 78. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 

25 carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 

30 serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; the substantially purified sortase-transamidase enzyme of claim 1 ; and (iii) a 
Gram-positive bacterium having a peptidoglycan to which the sortase-transamidase 
can link the polypeptide through the sorting signal; 

35 (c) binding the chimeric protein covalently to the cell wall by the 

enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
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that the polypeptide is displayed on the surface of the Gram— positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
5 partner. 

79. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
10 sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 

LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
15 located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 3; and 

20 (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

25 that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

30 

80. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
35 LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
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positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

5 (b) forming a reaction mixture including: (i) the expressed chimeric 

protein; (ii) the substantially purified sortase-transamidase enzyme of claim 3 1 ; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
10 enzymatic action of a sortase-transamidase expressed by the Gram-positive 

bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
15 partner to screen the chimeric protein for reactivity with the labeled specific binding 

partner. 



81 . A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

20 (a) expressing a cloned polypeptide as a chimeric protein having a 

sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 

25 positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
30 protein; (ii) the substantially purified sortase-transamidase enzyme of claim 32; and 

(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 

35 bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 
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(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 



5 82. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 

10 carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 

15 serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 33; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

20 (c) binding the chimeric protein covalently to the cell wall by the 

enzymatic action of a sortase-transamidase expressed by the Gram— positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

25 (d) reacting the displayed polypeptide with a labeled specific binding 

partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 



83. A method for screening for expression of a cloned polypeptide 
30 comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
35 residues carboxyl to the substantially hydrophobic domain, at least one of the two 

positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 
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occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 34; and 

5 (iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 
transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 

10 that the polypeptide is displayed on the surface of the Gram-positive bacterium in 
such a way that the polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

15 

84. A method for screening for expression of a cloned polypeptide 
comprising the steps of: 

(a) expressing a cloned polypeptide as a chimeric protein having a 
sorting signal at its carboxy-terminal end, the sorting signal having: (1) a motif of 

20 LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 
residues carboxyl to the substantially hydrophobic domain, at least one of the two 
positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 

25 occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine; 

(b) forming a reaction mixture including: (i) the expressed chimeric 
protein; (ii) the substantially purified sortase-transamidase enzyme of claim 35; and 
(iii) a Gram-positive bacterium having a peptidoglycan to which the sortase- 

30 transamidase can link the polypeptide through the sorting signal; 

(c) binding the chimeric protein covalently to the cell wall by the 
enzymatic action of a sortase-transamidase expressed by the Gram-positive 
bacterium involving cleavage of the chimeric protein within the LPX3X4G motif so 
that the polypeptide is displayed on the surface of the Gram-positive bacterium in 

35 such a way that the polypeptide is accessible to a ligand; and 
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(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 

5 85. A method for screening for expression of a cloned polypeptide 

comprising the steps of: 

(a) cloning a nucleic acid segment encoding a chimeric protein into a 
Gram-positive bacterium to generate a cloned chimeric protein including therein a 
carboxyl-terminal sorting signal, the chimeric protein including the polypeptide 

10 whose expression is to be screened, the sorting signal having: (1) a motif of LPX3X4G 
therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids carboxyl to 
the motif; and (3) a charged tail region with at least two positively charged residues 
carboxyl to the substantially hydrophobic domain, at least one of the two positively 
charged residues being arginine, the two positively charged residues being located at 

15 residues 31-33 from the motif, wherein X 3 is any of the twenty naturally-occurring 
L-amino acids and X4 is selected from the group consisting of alanine, serine, and 
threonine; 

(b) growing the bacterium into which the nucleic acid segment has 
been cloned to express the cloned chimeric protein to generate a chimeric protein 

20 including therein a carboxyl-terminal sorting signal; 

(c) binding the polypeptide covalently to the cell wall by the enzymatic 
action of a sortase-transamidase expressed by the Gram-positive bacterium involving 
cleavage of the chimeric protein within the LPX3X4G motif so that the polypeptide is 
displayed on the surface of the Gram-positive bacterium in such a way that the 

25 polypeptide is accessible to a ligand; and 

(d) reacting the displayed polypeptide with a labeled specific binding 
partner to screen the chimeric protein for reactivity with the labeled specific binding 
partner. 



30 86. A method for the diagnosis or treatment of a bacterial infection 

caused by a Gram-positive bacterium comprising the steps of: 

(a) conjugating an antibiotic or a detection reagent to a protein 
including therein a carboxyl-terminal sorting signal to produce a conjugate, the 
carboxyl-terminal sorting signal having: (1) a motif of LPX3X4G therein; (2) a 

35 substantially hydrophobic domain of at least 3 1 amino acids carboxyl to the motif; and 
(3) a charged tail region with at least two positively charged residues carboxyl to the 
substantially hydrophobic domain, at least one of the two positively charged residues 
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being arginine, the two positively charged residues being located at residues 31-33 
from the motif, wherein X 3 is any of the twenty naturally-occurring L-amino acids 
and X 4 is selected from the group consisting of alanine, serine, and threonine; and 

(b) introducing the conjugate to an organism infected with a Gram- 
5 positive bacterium in order to cause the conjugate to be sorted and covalently cross- 
linked to the cell walls of the bacterium in order to treat or diagnose the infection. 

87. The method of claim 86 wherein an antibiotic is conjugated to the 

protein. 

10 

88. The method of claim 87 wherein the antibiotic is selected from the 
group consisting of a penicillin, ampicillin, vancomycin, gentamicin, streptomycin, a 
cephalosporin, amikacin, kanamycin, neomycin, paromomycin, tobramycin, 
ciprofloxacin, clindamycin, rifampin, chloramphenicol, norfloxacin, and a derivative 

1 5 of these antibiotics. 

89. The method of claim 86 wherein a detection reagent is conjugated 

to the protein. 

20 90. A conjugate comprising an antibiotic or a detection reagent 

covalently conjugated to a protein including therein a carboxyl-terminal sorting signal 
to produce a conjugate, the carboxyl-terminal sorting signal having: (1) a motif of 
LPX3X4G therein; (2) a substantially hydrophobic domain of at least 3 1 amino acids 
carboxyl to the motif; and (3) a charged tail region with at least two positively charged 

25 residues carboxyl to the substantially hydrophobic domain, at least one of the two 

positively charged residues being arginine, the two positively charged residues being 
located at residues 31-33 from the motif, wherein X 3 is any of the twenty naturally- 
occurring L-amino acids and X4 is selected from the group consisting of alanine, 
serine, and threonine. 

30 

91 . The conjugate of claim 90 wherein an antibiotic is conjugated to 

the protein. 



35 



92. The conjugate of claim 91 wherein the antibiotic is selected from 
the group consisting of a penicillin, ampicillin, vancomycin, gentamicin, 
streptomycin, a cephalosporin, amikacin, kanamycin, neomycin, paromomycin, 
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tobramycin, ciprofloxacin, clindamycin, rifampin, chloramphenicol, norfloxacin, and 
a derivative of these antibiotics. 

93. The conjugate of claim 90 wherein a detection reagent is 
5 conjugated to the protein. 



94. A composition comprising: 

(a) the conjugate of claim 90; and 

(b) a pharmaceutically acceptable carrier. 

10 

95. A substantially purified protein having at least about 50% match 
with best alignment, in at least one subunit of the protein, with the amino acid 
sequences of at least one of the putative Bacillus peptidase (SEQ ID NO: 3), the 
aminopeptidase P of Lactococcus lactis (SEQ ID NO: 4), or the proline dipeptidase of 

15 Lactobacillus delbrueckii lactis (SEQ ID NO: 5) and having sortase-transamidase 
activity. 



96. The substantially purified protein of claim 95 wherein the match 
with best alignment with the amino acid sequences of at least one of the putative 
20 Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of Lactococcus lactis (SEQ 
ID NO: 4), or the proline dipeptidase of Lactobacillus delbrueckii lactis (SEQ ID NO: 
5) is at least about 60%. 



97. The substantially purified protein of claim 96 wherein the match 
25 with best alignment with the amino acid sequences of at least one of the putative 

Bacillus peptidase (SEQ ID NO: 3), the aminopeptidase P of Lactococcus lactis (SEQ 
ID NO: 4), or the proline dipeptidase of Lactobacillus delbrueckii lactis (SEQ ID NO: 
5) is at least about 70%. 



30 98. A substantially purified protein having sortase-transamidase 

activity and a hydrophobicity profile of at least one subunit of the protein, that, 
determined as the mean absolute value of the hydrophobicity difference per residue, 
differs from the hydrophobicity profile of a putative Bacillus peptidase (SEQ ID NO: 
3) by no more than about 2 units on the hydrophobicity scale. 



35 
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99. The substantially purified protein of claim 98 wherein the 
hydrophobicity profile differs from the hydrophobicity profile of the putative Bacillus 
peptidase (SEQ ID NO: 3) by no more than about 1 unit. 

5 100. The substantially purified protein of claim 99 wherein the 

hydrophobicity profile differs from the hydrophobicity profile of the putative Bacillus 
peptidase (SEQ ID NO: 3) by no more than about 0.5 unit. 

101. A nucleic acid sequence encoding the substantially purified 
10 protein of claim 95. 

102. A nucleic acid sequence encoding the substantially purified 
protein of claim 98. 

15 103. A vector comprising the nucleic acid sequence of claim 101 

operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

104. A vector comprising the nucleic acid sequence of claim 102 
20 operatively linked to at least one control sequence that controls the expression or 

regulation of the nucleic acid sequence. 

105. A host cell transfected with the vector of claim 103. 
25 106. A host cell transfected with the vector of claim 104. 

107. A method for producing a substantially purified protein having 
sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 105 under conditions in which the 
30 host cell expresses the protein having sortase-transamidase activity; and 

(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 

108. A method for producing a substantially purified protein having 
35 sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 106 under conditions in which the 
host cell expresses the protein having sortase-transamidase activity; and 
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(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 

109. A substantially purified protein having sortase-transamidase 
5 activity and a hydrophobicity profile of at least one subunit of the protein, that, 

determined as the mean absolute value of the hydrophobicity difference per residue, 
differs from the hydrophobicity profile of the sequence of SEQ ID NO: 31 by no more 
than about 2 units on the hydrophobicity scale. 

10 110. The substantially purified protein of claim 1 09 wherein the 

hydrophobicity profile differs from the hydrophobicity profile of the sequence of SEQ 
ID NO: 31 by no more than about 1 unit. 

111. The substantially purified protein of claim 110 wherein the 

15 hydrophobicity profile differs from the hydrophobicity profile of the sequence of SEQ 
ID NO: 31 by no more than about 0.5 unit. 

112. A nucleic acid sequence encoding the substantially purified 
protein of claim 109. 

20 

1 13. A vector comprising the nucleic acid sequence of claim 112 
operatively linked to at least one control sequence that controls the expression or 
regulation of the nucleic acid sequence. 

25 1 14. A host cell transfected with the vector of claim 113. 

115. A method for producing a substantially purified protein having 
sortase-transamidase activity comprising the steps of: 

(a) culturing the host cell of claim 1 14 under conditions in which the 
30 host cell expresses the protein having sortase-transamidase activity; and 

(b) purifying the expressed protein to produce substantially purified 
protein having sortase-transamidase activity. 
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120 

He Lys lie Leu 
135 

His He Leu Thr 
150 

Asn Glu Leu Glu 
165 

Phe Asp Met He 

Val Ala Ser Asp 

200 

Phe Gly Ala Tyr 
215 

Ala Val Gly Gin 
230 

Phe Asp Ala Gin 
245 

Gly Lys Glu Ala 

Tyr Gly Asp Tyr 

280 

Val His Glu Ser 
295 

Pro Gly Met Val 

' 310 
Gly Gly Val Arg 
325 

Arg Thr He Thr 



75 

Arg Leu Gly Phe 
90 

Tyr Ser Ala Val 
105 

Val Glu Lys Leu 

Glu Glu Ala Ala 

140 

Phe Met Lys Pro 
155 

Phe Tyr Met Arg 
170 

Val Ala Ser Gly 
185 

Lys Leu He Glu 

Tyr Lys Gly Tyr 

220 

Pro Ser Asp Gin 
235 

Ala Leu Gly Val 
250 

Asp Ala Leu Thr 
265 

Phe Gly His Ser 

Pro Gly Leu Ser 

300 

Val Thr Val Glu 
315 

He Glu Asp Asp 
330 

His Ser Pro Lys 
345 



80 

Glu Gin Asn Ser 
95 

He Ser Asp Ala 
110 

Arg Leu He Lys 
125 

Lys He Ala Asp 

Gly He Ser Glu 

160 

Ser Gin Gly Ala 
175 

Leu Arg Ser Ser 
190 

Ser Gly Asp Leu 
205 

Cys Ser Asp He 

Leu Lys Glu He 

240 

Ala His He Lys 
255 

Arg Asp His He 
270 

Thr Gly His Gly 
285 

Val Arg Ser Ser 

Pro Gly He Tyr 

320 

lie Val lie Thr 
335 

Glu Leu lie lie 
350 



<210> 4 
<211> 352 
<212> PRT 

<213> Lactococcus Lactis 
<400> 4 



Met 


Arg 


lie 


Glu 


Lys 


Leu 


Lys 


Val 


Lys 


Met 


Leu 


Thr 


Glu 


Asn 


lie 


Asp 


1 








5 










10 










15 
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Leu 


Leu 


lie 


Thr 


Asp 


Met 


Lys 


Asn 


lie 


Phe 


Tyr 


Leu 


Thr 


Gly 


Phe 








20 










25 










30 




Ser 


Gly 


Thr 


Ala 


Gly 


Thr 


Val 


Phe 


Leu 


Thr 


Gin 


Lys 


Arg 


Asn 


lie 


Phe 






35 










40 










45 








Met 


Thr 


Asp 


Ser 


Arg 


Tyr 


Ser 


Glu 


Met 


Ala 


Arg 


Gly 


Leu 


lie 


Lys 


Asn 




50 










55 










60 








Phe 


Glu 


lie 


lie 


Glu 


Thr 


Arg 


Asp 


Pro 


lie 


Ser 


Leu 


Leu 


Thr 


Glu 


Leu 


65 










70 










75 










80 


Ser 


Ala 


Ser 


Glu 


Ser 


Val 


Lys 


Asn 


Met 


Ala 


Phe 


Glu 


Glu 


Thr 


Val 


Asp 










85 










90 










95 


Tyr 


Ala 


Phe 


Phe 


Lys 


Arg 


Leu 


Ser 


Lys 


Ala 


Ala 


Thr 


Lys 


Leu Asp 


Leu 








100 










105 










110 






Phe 


Ser 


Thr 


Ser 


Asn 


Phe 


Val 


Leu 


Glu 


Leu 


Arg 


Gin 


lie 


Lys 


Asp 


Glu 






115 










120 










125 




Ser 


Glu 


lie 


Ser 


Leu 


lie 


Lys 


Lys 


Ala 


Cys 


Glu 


lie 


Ala 


Asp 


Glu 


Ala 




130 










135 










140 








Phe 


Met 


Ser 


Ala 


Leu 


Arg 


Phe 


lie 


Glu 


Pro 


Gly Arg 


Thr 


Glu 


lie 


Glu 


145 










150 










155 










160 


Val 


Ala 


Asn 


Phe 


Leu 


Asp 


Phe 


Lys 


Met 


Arg 


Asp 


Leu 


Glu 


Ala 


Ser 


Gly 










165 










170 










175 


lie 


Ser 


Phe 


Glu 


Thr 


lie 


Val 


Ala 


Ser 


Gly 


Lys 


Arg 


Ser 


Ser 


Leu 


Pro 








180 










185 










190 






His 


Gly 


Val 


Ala 


Thr 


Ser 


Lys 


Met 


lie 


Gin 


Phe 


Gly 


Asp 


Pro 


Val 


Thr 
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10 



15 



20 



25 





195 










200 










205 








He Asp 


Phe 


Gly 


Cys 


Tyr 


Tyr 


Glu 


* m • 

His 


Tyr 


Ala 


Ser Asp Met 


Thr 


Arg 


210 










215 










220 








Thr He 


Phe 


Val 


Gly 


Ser 


Val 


Asp 


Asp 


Lys 


Met 


Arg 


Thr 


He 


Tyr 


Glu 


225 








230 










235 








240 


Thr Val 


Arg 


Lys 


Ala 


Asn 


Glu 


Ala 


Leu 


He 


Lys 


Gin 


Val 


Lys 


Ala 


Gly 








245 










250 








255 


Met Thr 


Tyr 


m, 

Ala 


Gin 


Tyr 


Asp 


Asn 


He 


Pro 


Arg 


Glu 


Val 


He 


Glu 


Lys 






260 










265 










270 




Ala Asp 


Phe 


Gly 


Gin 


Tyr 


Phe 


Thr 


His 


Gly 


He 


Gly 


His 


Gly 


Leu 


Gly 




275 










280 










285 




Leu Asp 


Val 


His 


Glu 


He 


Pro 


Tyr 


Phe 


Asn 


Gin 


Ser 


Met 


Thr 


Glu 


Asn 


290 










295 










300 










Gin Leu 


Arg 


Ser 


Gly 


Met 


Val 


He 


Thr 


Asp 


Glu 


Pro 


Gly 


He 


Tyr 


Leu 


305 








310 










315 






320 


Pro Glu 


Phe 


Gly 


Gly 
325 


Val 


Arg 


He 


Glu 


Asp 
330 


Asp 


Leu 


Leu 


Val 


Thr 
335 


Glu 


Asn Gly 


Cys 


Glu 
340 


Val 


Leu 


Thr 


Lys 


Ala 
345 


Pro 


Lys 


Glu 


Leu 


He 
350 


Val 


He 



<210> 5 
<211> 368 
<212> PRT 

<213> Lactobacillus Delbrueckii Lactis 







<400> 


5 




























Met 


Asn 


Leu 


Asp 


Lys 


Leu 


Gin 


Asn 


Trp 


Leu 


Gin 


Glu 


Asn 


Gly 


Met 


Asp 




1 


> 






5 










10 








15 


30 


Val 


Ala 


Tyr 


Val 


Ser 


Ser 


Pro 


Thr 


Thr 


He 


Asn 


Tyr 


Phe 


Thr 


Gly 


Phe 








20 










25 








30 






He 


Thr 


Asp 


Pro 


Glu 


Glu 


Arg 


lie 


Phe 


Lys 


Leu 


Phe 


Ala 


Phe 


Lys 


Asp 








35 










40 










45 






Ala 


Glu 


Pro 


Phe 


Leu 


Phe 


Cys 


Pro 


Ala 


Leu 


Asn 


Tyr 


Glu 


Glu 


Ala 


Lys 






50 










55 










60 








35 


Ala 
65 


Ser 


Ala 


Trp 


Asp 


Gly 
70 


Asp 


Val 


Val 


Gly 


Tyr 
75 


Leu 


Asp 


Ser 


Glu 


Asp 
80 




Pro 


Trp 


Ser 


Lys 


He 
85 


Ala 


Glu 


Glu 


lie 


Lys 
90 


Lys 


Arg 


Thr 


Lys 


Asp 
95 


Tyr 




Gin 


Asn 


Trp 


Ala 


Val 


Glu 


Lys 


Asn 


Gly 


Leu 


Thr 


Val 


Ala 


His 


Tyr 


Gin 


40 








100 










105 










110 






Ala 


Leu 


His 


Ala 


Gin 


Phe 


Pro 


Asp 


Ser 


Asp 


Phe 


Ser 


Lys 


Asp 


Leu 


Ser 








115 










120 










125 








Asp 


Phe 
130 


He 


Ala 


His 


He 


Arg 
135 


Leu 


Phe 


Lys 


Thr 


Glu 
140 


Ser 


Glu 


Leu 


Val 


45 


Lys 
145 


Leu 


Arg 


Lys 


Ala 


Gly 
150 


Glu 


Glu 


Ala 


Asp 


Phe 
155 


Ala 


Phe 


Gin 


He 


Gly 
160 




Phe 


Glu 


Ala 


Leu 


Arg 
165 


Asn 


Gly 


Val 


Thr 


Glu 
170 


Arg 


Ala 


Val 


Val 


Ser 
175 


Gin 




He 


Glu 


Tyr 


Gin 


Leu 


Lys 


Leu 


Gin 


Lys 


Gly 


Val 


Met 


Gin 


Thr 


Ser 


Phe 


50 








180 










185 










190 








Asp 


Thr 


He 


Val 


Gin 


Ala 


Gly 


Lys 


Asn 


Ala 


Ala 


Asn 


Pro 


His 


Gin 


Gly 








195 










200 










205 








Pro 


Ser 


Met 


Asn 


Thr 


Val 


Gin 


Pro 


Asn 


Glu 


Leu 


Val 


Leu 


Phe 


Asp 


Leu 






210 










215 










220 








55 


Gly 
225 


Thr 


Met 


His 


Glu 


Gly 
230 


Tyr 


Ala 


Ser 


Asp 


Ser 
235 


Ser 


Arg 


Thr 


Val 


Ala 
240 




Tyr 


Gly 


Glu 


Pro 


Thr 
245 


Asp 


Lys 


Met 


Arg 


Glu 
250 


lie 


Tyr 


Glu 


Val 


Asn 
255 


Arg 




Thr 


Ala 


Gin 


Gin 


Ala 


Ala 


lie 


Asp 


Ala 


Ala 


Lys 


Pro 


Gly 


Met 


Thr 


Ala 


60 








260 










265 








270 








Ser 


Glu 


Leu 
275 


Asp 


Gly 


Val 


Ala 


Arg 
280 


Lys 


lie 


lie 


Thr 


Asp 
285 


Ala 


Gly 


Tyr 




Gly 


Glu 


Tyr 


Phe 


lie 


His 


Arg 


Leu 


Gly 


His 


Gly 


lie 


Gly Met 


Glu 


Val 






290 










295 










300 










65 


His 
305 


Glu 


Phe 


Pro 


Ser 


lie 
310 


Ala 


Asn 


Gly 


Asn 


Asp 
315 


Val 


Val 


Leu 


Glu 


Glu 
320 




Gly 


Met 


Cys 


Phe 


Ser 
325 


He 


Glu 


Pro 


Gly 


lie 
330 


Tyr 


lie 


Pro 


Gly 


Phe 
335 


Ala 




Gly 


Val 


Arg 


lie 


Glu 


Asp 


Cys 


Gly 


Val 


Leu 


Thr 


Lys 


Asp 


Gly 


Phe 


Lys 
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340 345 350 

Pro Phe Thr His Thr Ser Lys Glu Leu Lys Val Leu Pro Val Lys Glu 
355 360 365 

5 <210> 6 

<211> 25 
<212> PRT 

<213> Staphylococcus Aureus 

10 <400> 6 

Glu Glu Asn Pro Phe lie Gly Thr Thr Val Phe Gly Gly Leu Ser Leu 

1 5 10 ^ 15 

Ala Leu Gly Ala Ala Leu Leu Ala Gly 

20 25 



15 



20 



<210> 7 
<211> 23 
<212> PRT 

<213> Staphylococcus Aureus 



<400> 7 

Gly Glu Glu Ser Thr Asn Lys Gly Met Leu Phe Gly Gly Leu Phe Ser 

15 10 " J 15 

lie Leu Gly Leu Ala Leu Leu 
25 20 

<210> 8 
<211> 24 
<212> PRT 

30 <213> Staphylococcus Sobrinos 

<400> 8 

Asp Ser Ser Asn Ala Tyr Leu Pro Leu Leu Gly Leu Val Ser Leu Thr 
15 10 15 

35 Ala Gly Phe Ser Leu Leu Gly Leu 

20 

<210> 9 
<211> 24 
40 <212> PRT 

<213> Enterococcus Faecalis 

<400> 9 

Glu Lys Gin Asn Val Leu Leu Thr Val Val Gly Ser Leu Ala Ala Met 
45 1 5 10 15 

Leu Gly Leu Ala Gly Leu Gly Phe 

20 

<210> 10 
50 <211> 23 

<212> PRT 

<213> Streptococcus Pyogenes 
<400> 10 

55 Ser lie Gly Thr Tyr Leu Phe Lys lie Gly Ser Ala Ala Met lie Gly 

15 10 15 

Ala He Gly He Tyr He Val 

20 

60 <210> 11 

<211> 22 
<212> PRT 

<213> Listeria Monocytogenes 

65 <400> 11 

Asp Ser Asp Asn Ala Leu Tyr Leu Leu Leu Gly Leu Leu Ala Val Gly 

1 5 10 15 

Thr Ala Met Ala Leu Thr 

20 



WO 99/09145 
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10 



15 



<210> 12 
<211> 5 
<212> PRT 

<213> Staphylococcus Aureus 

<400> 12 
Arg Arg Arg Glu Leu 

1 5 

<210> 13 

<211> 9 

<212> PRT 

<213> Staphylococcus Aureus 

<400> 13 

Arg Arg Asn Lys Lys Asn His Lys Ala 
1 5 

20 <210> 14 

<211> 5 

<212> PRT 

<213> Staphylococcus Sobrinus 

25 <400> 14 

Arg Arg Lys Gin Asp 
1 5 

<210> 15 
30 <211> 7 

<212> PRT 

<213> Enterococcus Faecalis 
<400> 15 

35 Lys Arg Arg Lys Glu Thr Lys 
1 5 

<210> 16 
<211> 5 
40 <212> PRT 

<213> Streptococcus Pyogenes 

<400> 16 
Lys Arg Arg Lys Ala 
45 1 5 

<210> 17 
<211> 8 
<212> PRT 

50 <213> Actinomyces Viscosus 

<400> 17 
Lys Arg Arg His Val Ala Lys His 
1 5 

55 

<210> 18 
<211> 5 
<212> PRT 

<213> Streptococcus Aglactiae 

<400> 18 
Lys Arg Arg Lys Ser 
1 ~ 5 

65 <210> 19 

<211> 6 
<212> PRT 

<213> Streptococcus Pyogenes 



60 
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<400> 19 
Lys Arg Lys Glu Glu Asn 
1 5 

5 <210> 20 

<211> 5 
<212> PRT 

<213> Artificial Sequence 

10 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 20 
Arg Arg Arg Glu Ser 
15 1 5 

<210> 21 
<211> 5 
<212> PRT 

20 <213> Artificial Sequence 

<220> 

<223> Mutated derived from Streptococcus Pyogenes 

25 <400> 21 

Arg Arg Arg Ser Leu 
1 5 

<210> 22 

30 <211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

35 <223> Mutated derived from Streptococcus Pyogenes 

<400> 22 
Arg Arg Ser Glu Leu 
1 5 



40 



45 



60 



<210> 23 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Mutated derived from Streptococcus Pyogenes 



<400> 23 

50 Arg Ser Arg Glu Leu 

1 5 

<210> 24 

<211> 5 

55 <212> PRT 

<213> Artificial Sequence 



<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 24 

Ser Arg Arg Glu Leu 
1 5 

65 <210> 25 

<211> 5 

<212> PRT 

<213> Artificial Sequence 



WO 99/09145 



PGT/US98/16229 



-7- 

<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 25 
5 Arg Arg Ser Ser Ser 
1 5 

<210> 26 
<211> 5 
10 <212> PRT 

<213> Artificial Sequence 



15 



<220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 26 
Arg Ser Arg Ser Ser 
1 5 

20 <210> 27 

<211> 5 
<212> PRT 

<213> Artificial Sequence 

25 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 27 
Ser Arg Arg Ser Ser 
30 1 5 

<210> 28 
<211> 1017 
<212> DNA 

35 <213> Staphylococcus Aureus 

<220> 
<221> CDS 

<222> (716) . . . (1014) 

40 

<400> 28 

cgactctaga ggacaagcaa ctaagcaggc gccaaattat gaaattatta atcgtaaatc 60 
tactattatt ggtgagatta aaaaattgct acnccaanaa aattttgaaa atgttngttt 120 
tgaggggcat catgtnnttt atgatacata ccttgaatta aataaaagcc gtatatcatt 180 

45 aataagcatt tctaatactg tagataaaat tagagacgtc caagatgctg acgaaattgc 240 
tttaattcaa aaagcagcta atattgttga tgaaacatat gaatatattt taactgttgt 300 
aaaagcnngc atgactgaaa aanaattaaa ggcaatatta gaaagccaaa tgctanaatt 360 
aggagcagat ggaccntctt tcgatacgat tgtngcatct ggtcctagag gtgcattacc 420 
accatggtgt tgcaagtgat aaaattattg aaaaaggcga catgattaca ttanattttg 4 80 

50 ggcgcnttnt tataaccggc tattgttcaa attttactan aacattttgc nattgggaaa 54 0 
aacccgatcc ntaaactgaa aagaaattgt agcatctgtt catagaggtg cattaccaca 600 
tggtgttgca agtgataaaa ttattgaaaa aggcgacatg atnacattag atttnggcgc 660 
gtattataac ggctattgtt cagatattac tagaacattt gctattggag aacca gat 718 

Asp 

55 1 

cct aaa ctg aaa gaa ata tat caa ata gta ctt gaa tct caa atg aaa 766 
Pro Lys Leu Lys Glu lie Tyr Gin lie Val Leu Glu Ser Gin Met Lys 

5 10 15 

gca att aat gag att aga cct ggc atg act ggt gca gaa get gat gec 814 
Ala lie Asn Glu lie Arg Pro Gly Met Thr Gly Ala Glu Ala Asp Ala 
20 25 30 

65 att tea aga aac tat tta gag tea aaa ggg tat gga aaa gaa ttt gga 862 
lie Ser Arg Asn Tyr Leu Glu Ser Lys Gly Tyr Gly Lys Glu Phe Gly 
35 40 45 



60 



cat tea eta gga cat ggt att ggt tta gaa ate cat gaa ggg cca atg 



910 
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75 
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gtt 
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ata 
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ata 
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Val 


Xaa Pro 


Gly 


Val 


Xaa 


He 


Glu 


Gly 


Leu Gly 


Gly 


He 


Arg 


He 


Glu 


10 






85 










90 










95 








gag 


ata tt taa 




























Glu 


lie 
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29 






























<211> 


100 






























<212> 


PRT 


























20 




<213> 


Staphylococcus Aureus 




















<400> 


29 




























Asp 


Pro Lys 


Leu 


Lys 


Glu 


He 


Tyr 


Gin 


He 


Val 


Leu 


Glu 


Ser 


Gin 


Met 




1 






5 










10 










15 




25 


Lys 


Ala He 


Asn 


Glu 


He 


Arg 


Pro 


Gly Met 


Thr 


Gly Ala 


Glu 


Ala 


Asp 








20 










25 










30 






AT a 


He Ser 


Arg 


Asn 


Tyr 


Leu 


Glu 


Ser 


Lys 


Glv 


Tyr 


Gly 


T, wo 


O -L Li 








35 










40 










45 










Gly 


His Ser 


Leu 


Gly 


His 


Gly 


He 


Gly 


Leu 


Glu 


He 


His 


Glu 


Gly 


Pro 


30 




50 








55 










Ou 










Met 


Leu Xaa 


Arg 


Thr 


He 


Gin 


Asp 


Lys 


T "i *t 
Leu 


Gin 


Val 


7\ o 


Asn 


Cys 


Val 




65 








70 










75 










80 




Thr 


Val Xaa 


Pro 


Gly 


Val 


Xaa 


He 


Glu 


Gly 


Leu 


Gly 


Gly 


He 


Arg 


He 










85 










90 










95 




35 


Glu 


Glu He 


Phe 
































100 






























<210> 


30 






























<Z 1 1> 


360 


























40 




<212> 


DNA 






























<213> 


Staphlococcus Aureus 






















































CDS 


























45 




<222> 


(1) - 


. . (357) 


























<400> 


30 




























atg 


gtc aaa 


gta 


act 


gat 


tat 


tea 


aat 


tea 


aaa 


tta 


ggt 


aaa 


gta 


gaa 




Met 


Val Lys 


Val 


Thr 


Asp 


Tyr 


Ser 


Asn 


Ser 


Lys 


Leu 


Gly 


Lys 


Val 


Glu 


50 


1 






5 










10 










15 






ata 


gcg cca 


gaa 


gtg 


eta 


tct 


gtt 


att 


gca 


agt 


ata 


get 


act 


teg 


gaa 




lie 


Ala Pro 


Glu 


Val 


Leu 


Ser 


Val 


He 


Ala 


Ser 


He 


Ala 


Thr 


Ser 


Glu 








20 










25 










30 






55 


































gtc 


gaa ggc 


ate 


act 


ggc 


cat 


ttt 


get 


gaa 


tta 


aaa 


gaa 


aca 


aat 


tta 




Val 


Glu Gly 


He 


Thr 


Gly 


His 


Phe 


Ala 


Glu 


Leu 


Lys 


Glu 


Thr 


Asn 


Leu 






35 










40 










45 








60 


gaa 


aaa gtt 


agt 


cgt 


aaa 


aat 


tta 


age 


cgt 


gat 


tta 


aaa 


ate 


gag 


agt 




Glu 


Lys Val 


Ser 


Arg 


Lys 


Asn 


Leu 


Ser 


Arg 


Asp 


Leu 


Lys 


He 


Glu 


Ser 






50 








55 










60 












aaa 


gaa gat 


ggc 


ata 


tat 


ata 


gat 


gta 


tat 


tgt 


gca 


tta 


aaa 


cat 


ggt 


65 


Lys 


Glu Asp 


Gly 


He 


Tyr 


He 


Asp 


Val 


Tyr 


Cys 


Ala 


Leu 


Lys 


His 


Gly 




65 








70 










75 










80 



958 



1006 



1017 



48 



96 



144 



192 



240 



aat att tea aaa act gca aac aaa att caa acg tea att ttt aat tea 
Asn He Ser Lys Thr Ala Asn Lys He Gin Thr Ser He Phe Asn Ser 



288 
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85 90 95 

att tct aat atg aca gcg ata gaa cct aag caa att aat att cac att 336 
lie Ser Asn Met Thr Ala lie Glu Pro Lys Gin lie Asn lie His lie 
5 100 105 110 

aca caa ate gtt att gaa aag taa 360 
Thr Gin lie Val He Glu Lys 
115 

10 

<210> 31 
<211> 119 
<212> PRT 

15 <213> Staphylococcus Aureus 



<400> 31 
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Val 


Lys 


Val 


Thr 


Asp 


Tyr 


Ser 


Asn 


Ser 


Lys 


Leu 


Gly 


Lys 


Val 


Glu 




1 








5 










10 










15 




20 


He 


Ala 


Pro 


Glu 


Val 


Leu 


Ser 


Val 


He 


Ala 


Ser 


He 


Ala 


Thr 


Ser 


Glu 










20 










25 










30 








Val 


Glu 


Gly 


lie 


Thr 


Gly 


His 


Phe 


Ala 


Glu 


Leu 


Lys 


Glu 


Thr 


Asn 


Leu 








35 










40 










45 










Glu 


Lys 


Val 


Ser 


Arg 


Lys 


Asn 


Leu 


Ser 


Arg 


Asp 


Leu 


Lys 


He 


Glu 


Ser 


25 




50 










55 










60 












Lys 


Glu 


Asp 


Gly 


He 


Tyr 


He 


Asp 


Val 


Tyr 


Cys 


Ala 


Leu 


Lys 


His 


Gly 




65 










70 










75 










80 




Asn 


He 


Ser 


Lys 


Thr 


Ala 


Asn 


Lys 


He 


Gin 


Thr 


Ser 


He 


Phe 


Asn 


Ser 












85 










90 










95 




30 


He 


Ser 


Asn 


Met 


Thr 


Ala 


He 


Glu 


Pro 


Lys 


Gin 


He 


Asn 


He 


His 


He 










100 










105 










110 








Thr 


Gin 


He 


Val 


He 


Glu 


Lys 





















115 



35 <210> 32 

<211> 19 
<212> PRT 

<213> Artificial Sequence 

40 <220> 

<223> Mutated derived from Streptococcus Pyogenes 

<400> 32 

His His His His His His Ala Gin Ala Leu Glu Pro Thr Gly Glu Glu 
45 1 5 10 15 

Asn Pro Phe 



<210> 33 
50 <211> 1119 

<212> DNA 

<213> Streptococcus Pyogenes 
<220> 

55 <221> CDS 

<222> (1) . . . (995) 

<400> 33 

atg eta caa tat tct caa aag tta cca aag gag ttc gcg atg tea gga 48 

60 Met Leu Gin Tyr Ser Gin Lys Leu Pro Lys Glu Phe Ala Met Ser Gly 

15 10 15 

ttt tta gaa caa cga tta ggt cac tgc eta agg cag atg gca gag aag 96 
Phe Leu Glu Gin Arg Leu Gly His Cys Leu Arg Gin Met Ala Glu Lys 
65 20 25 30 

ggg eta gag get ctt eta gtc ace cat tta ace aat agt tat tac ttg 144 
Gly Leu Glu Ala Leu Leu Val Thr His Leu Thr Asn Ser Tyr Tyr Leu 
35 40 45 



WO 99/09145 



PCT/US98/16229 



25 
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aca ggt ttt tct gga act gca gca act gtt ttg ata acg gcc aaa cgt 192 
Thr Gly Phe Ser Gly Thr Ala Ala Thr Val Leu He Thr Ala Lys Arg 
50 55 60 

5 

cgt gtt ttg ate aca gat tea cgt tat acc ttg ctt get aaa get agt 240 
Arg Val Leu He Thr Asp Ser Arg Tyr Thr Leu Leu Ala Lys Ala Ser 
65 70 75 80 

10 gtt gag gga ttt gat att ate gaa age cgc acg ccg ctt aag gtt gtg 288 
Val Glu Gly Phe Asp He He Glu Ser Arg Thr Pro Leu Lys Val Val 

85 90 95 

gca gaa ttg tta gag get gat caa ata gat tgc ctt ggt ttt gag gac 336 
15 Ala Glu Leu Leu Glu Ala Asp Gin He Asp Cys Leu Gly Phe Glu Asp 

100 105 110 

cag gta teg ttt tct ttt tac cag gcc atg caa gca gaa ctg tea gga 384 
Gin Val Ser Phe Ser Phe Tyr Gin Ala Met Gin Ala Glu Leu Ser Gly 
20 115 120 125 

ata acc ttg ctt get cag tea ggt ttt gtg gag cat tta cgt ctt att 432 
He Thr Leu Leu Ala Gin Ser Gly Phe Val Glu His Leu Arg Leu He 
130 135 140 

aag gac gcc tct gaa ate gat acc att get aaa gcg tgc teg ate tea 480 
Lys Asp Ala Ser Glu He Asp Thr He Ala Lys Ala Cys Ser He Ser 
145 150 155 160 

30 gac aaa gca ttt gaa gat get ctt gat ttt att aaa cca ggg aca acc 528 
Asp Lys Ala Phe Glu Asp Ala Leu Asp Phe He Lys Pro Gly Thr Thr 

165 170 175 

act gaa cgt gac ctg get aat ttt tta gat ttt cgt atg cgt cag tat 576 
35 Thr Glu Arg Asp Leu Ala Asn Phe Leu Asp Phe Arg Met Arg Gin Tyr 

180 185 " 190 

ggt gcc age ggc aca tea ttt gat ate att gta get tea ggc tat etc 624 
Gly Ala Ser Gly Thr Ser Phe Asp He He Val Ala Ser Gly Tyr Leu 
40 195 200 205 

tct gcc atg cct cat gga cgc gcc agt gac aag gtt ate cag aat aaa 672 

Ser Ala Met Pro His Gly Arg Ala Ser Asp Lys Val He Gin Asn Lys 

210 215 220 

45 

gag age ttg acc atg gac ttt ggg tgt tac tac aat cac tat gtt agt 720 

Glu Ser Leu Thr Met Asp Phe Gly Cys Tyr Tyr Asn His Tyr Val Ser 

225 230 235 240 

50 gat atg acg agg acc att cat att cat att ggc caa gtt act gat gaa 7 68 

Asp Met Thr Arg Thr He His He His He Gly Gin Val Thr Asp Glu 

245 250 255 

gaa cgt gag att tat get ctt gtt ctt get get aat aag get tta att 816 
55 Glu Arg Glu He Tyr Ala Leu Val Leu Ala Ala Asn Lys Ala Leu lie 

260 265 270 

get aaa get age get ggc atg act tat agt gac ttt gac ggt att ccg 864 
Ala Lys Ala Ser Ala Gly Met Thr Tyr Ser Asp Phe Asp Gly He Pro 
60 275 280 285 

cgc caa etc ate act gag gcg ggt tat ggc agt cgc ttc aca cat ggc 912 
Arg Gin Leu He Thr Glu Ala Gly Tyr Gly Ser Arg Phe Thr His Gly 
290 295 300 



65 



att ggt cat ggc ate ggg ctt gac ate cat gag aat cca ttt ttt ggg 960 
He Gly His Gly He Gly Leu Asp He His Glu Asn Pro Phe Phe Gly 
305 310 * 315 320 
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aaa tct gag caa ctt etc caa get gga atg gtg gt aacagatgag 1005 
Lys Ser Glu Gin Leu Leu Gin Ala Gly Met Val 

325 330 

5 ccaggtatct atttggataa caaatatggt gtccgtattg aagatgactt ggttatcaca 1065 
aaactggctt gtcaagtctt gaccttggca cccaaagaat taattgtatt gtaa 1119 

<210> 34 
<211> 332 
10 <212> PRT 

<213> Streptococcus Pyogenes 



<400> 34 





Met 


Leu 


Gin 


Tyr 


Ser 


Gin 


Lys 


Leu 


Pro 


Lys 


Glu 


Phe 


Ala 


Met 


Ser 


Gly 


1 c 

15 


1 








5 










10 










15 




Phe 


Leu 


Glu 


Gin 


Arg 


Leu 


Gly 


His 


Cys 


Leu 


Arg 


Gin 


Met 


Ala 


Glu 


Lys 










20 










25 










30 






Gly 


Leu 


Glu 


Ala 


Leu 


Leu 


Val 


Thr 


His 


Leu 


Thr 


Asn 


Ser 


Tyr 


Tyr 


Leu 








35 










40 










45 




20 


Thr 


Gly 


Phe 


Ser 


Gly 


Thr 


Ala 


Ala 


Thr 


Val 


Leu 


He 


Thr 


Ala 


Lys 


Arg 






50 










55 










60 








Arg 


Val 


Leu 


He 


Thr 


Asp 


Ser 


Arg 


Tyr 


Thr 


Leu 


Leu 


Ala 


Lys 


Ala 


Ser 




65 










70 










75 








80 




Val 


Glu 


Gly 


Phe 


Asp 


He 


He 


Glu 


Ser 


Arg 


Thr 


Pro 


Leu 


Lys 


Val 


Val 


25 










85 










90 








95 






Ala 


Glu 


Leu 


Leu 


Glu 


Ala 


Asp 


Gin 


He 


Asp 


Cys 


Leu 


Gly 


Phe 


Glu 


Asp 










100 










105 










110 






Gin 


Val 


Ser 


Phe 


Ser 


Phe 


Tyr 


Gin 


Ala 


Met 


Gin 


Ala 


Glu 


Leu 


Ser 


Gly 








115 










120 










125 








lie 


Thr 


IT 

Leu 


Leu 


Ala 


Gin 


Ser 


Gly 


Phe 


Val 


Glu 


His 


Leu 


Arg 


Leu 


He 






130 










135 










140 










Lys 


Asp 


Ala 


Ser 


GlU 


lie 


Asp 


1 nr 


He 


Ala 


■r _ _ _ 

Lys 


Ala 


Cys 


Ser 


He 


Ser 




145 










150 










155 










160 




Asp 


Lys 


Ala 


Phe 


Glu 


Asp 


Ala 


Leu 


Asp 


Phe 


He 


Lys 


Pro 


Gly 


Thr 


Thr 


35 










165 










170 








175 






Thr 


Glu 


Arg 


Asp 
180 


Leu 


Ala 


Asn 


Phe 


Leu 
185 


Asp 


Phe 


Arg 


Met 


Arg 
190 


Gin 


Tyr 




Gly 


Ala 


Ser 


Gly 


Thr 


Ser 


Phe 


Asp 


He 


He 


Val 


Ala 


Ser 


Gly 


Tyr 


Leu 








195 










200 










205 






40 


Ser 


Ala 


Met 


Pro 


His 


Gly 


Arg 


Ala 


Ser 


Asp 


Lys 


Val 


He 


Gin 


Asn 


Lys 






210 










215 










220 










Glu 


Ser 


Leu 


Thr 


Met 


Asp 


Phe 


Gly 


Cys 


Tyr 


Tyr 


Asn 


His 


Tyr 


Val 


Ser 




225 










230 










235 










240 




Asp 


Met 


Thr 


Arg 


Thr 


He 


His 


He 


His 


He 


Gly 


Gin 


Val 


Thr 


Asp 


Glu 


45 










245 










250 










255 






Glu 


Arg 


Glu 


He 
260 


Tyr 


Ala 


Leu 


Val 


Leu 
265 


Ala 


Ala 


Asn 


Lys 


Ala 
270 


Leu 


He 




Ala 


Lys 


Ala 
275 


Ser 


Ala 


Gly 


Met 


Thr 
280 


Tyr 


Ser 


Asp 


Phe 


Asp 
285 


Gly 


He 


Pro 


50 


Arg 


Gin 
290 


Leu 


He 


Thr 


Glu 


Ala 
295 


Gly 


Tyr 


Gly 


Ser 


Arg 
300 


Phe 


Thr 


His 


Gly 




He 


Gly 


His 


Gly 


He 


Gly 


Leu 


Asp 


He 


His 


Glu 


Asn 


Pro 


Phe 


Phe 


Gly 




305 










310 










315 










320 




Lys 


Ser 


Glu 


Gin 


Leu 


Leu 


Gin 


Ala 


Gly 


Met 


Val 


Val 











55 325 330 
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